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FELINE IMMUNODEFICIENCY VIRUS CLONE JSY3 

This invention was made with government support 
under Public Health Service grant NOl AI 35515 from the 
NIAIDS-DAIDS. The government may have certain rights to 
this invention. 

T7-jp>1ri of Hi ft Inve ntion 
This invention concerns a Feline 
immunodeficiency Virus molecular clone which is highly 
infectious in vivo and which produces immunodeficiency in 
infected subjects. 

parVf frmind of H^ Invention 
Feline immunodeficiency virus (FIV), a 
Antivirus of cats, is associated with feline acquired 
immunodeficiency syndrome (AIDS) . See N. Pedersen et al . , 
Science 235: 790 (1987). Disorders associated with FIV 
infection include chronic gingivitis/stomatitis, chronic 
upper respiratory infections, chronic enteritis, and 
recurrent ocular disease. See R. English et al.. J. Am. 
Vet. Med. Assoc . 196: 1116 (1990); N. Pedersen et al., Vet. 
Itanunol. unmunopathol. 21: 111 (1989); J. Yamamoto et al., 
J. Am. Vet. Med. Assoc. 194: 213 (1989). What is known to 
date of the pathogenesis of FIV infection suggests that it 
is a valuable animal model for other retroviral diseases, 
such as human immunodeficiency virus-1 (HIV-1) infection. 
HIV-1 and FIV belong to the lentivirus subfamily of 
retroviruses, have similar morphology, protein composition, 
and Mg 2 --dependency of their reverse transcriptases (RT) . 
See N Pedersen et al . , Science 235: 790 (1987); N. 
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Pedersen et al . , Vet. Immunol. Immunopathol . 21:111 (19B9) . 
They both display tropism for T lymphocytes and monocytes 
and are capable of inducing these cells to form syncytia. 
See D. Brunner and N. Pedersen, J. Virol. 63: 5483 (1989); 
M. Gardner and P. Luciw, FASEB Journal 3: 2593 (1989). 
HIV-1 displays a particular tropism for CD4 + lymphocytes, 
which leads to their gradual depletion and an inversion of 
the CD4 + :CD8 + ratio. See A. Dalgleish et al . , Nature 312: 
763 (1984) . The pathogenesis of HIV-1 infection has been 
attributed to virus -induced reduction of CD4* lymphocyte 
numbers and functions, resulting in decreased immune 
responsiveness and subsequent severe secondary infections. 
See M. McChesney and M. Oldstone, Ad. Immunol. 45: 335 
(1989) . 

Yamamoto et al. studied the early events in the 
pathogenesis of FIV in kittens. See J. Yamamoto et al . , Am. 
J*. Vet. Res. 49: 1246 (1988) . These kittens developed an 
acute infection syndrome similar to that seen in HIV-1, 
including low grade fever and transient generalized 
0 lymphadenopathy . More recent studies by Ackley et al . , J. 
Virol. 64: 5652 (1990), utilized monoclonal antibodies 
directed against feline CD4 + and CD8* homologues and Pan T 
cells to analyze lymphocyte profiles in SPF cats 
experimentally infected with FIV. These authors reported 
5 that a significant inversion of the CD4 + :CD8 + ratios 
occurred only in cats infected for 18 months or more. The 
inversion was associated with a decrease in absolute number 
of CD4* cells and an increase in CD8* cells. 

A panel of monoclonal antibodies specific for feline T 
0 cell subsets (M. Tompkins et al . , Vet. Immunol. 
Immunopathol. 26: 305 (1990)) has been used to analyze T 
cell numbers and profiles in cats naturally infected with 
FIV. See C. Novotney et al . , AIDS 4: 1213 (1990) . Similar 
to the observation of Ackley et al . supra, cats naturally 
,5 infected with FIV have an inverted CD4*:CD8* ratio 
characterized by a selective reduction in CD4* cells. 
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Summarv of the Invention 
A first aspect of the present invention is an isolated 
feline immunodeficiency virus (FIV) having all of the 
identifying characteristics of FIV clone JSY3 . 
5 A further aspect of the present invention is an 

isolated feline immunodeficiency virus (FIV) whose proviral 
DNA comprises a DNA sequence selected from SEQ ID N0:1 and 
sequences which vary from SEQ ID N0:1 due to the degeneracy 
of the genetic code. 
10 A further aspect of the present invention is a 

biologically pure culture of host cells containing feline 
immunodeficiency virus as described above. 

A further aspect of the present invention is isolated 
DNA comprising a DNA sequence selected from SEQ ID NO:l and 
15 sequences which vary from SEQ ID N0:1 due to the degeneracy 
of the genetic code; vectors containing such DNA; and host 
cells containing and capable of expressing such vectors. 

A further aspect of the present invention is isolated 
DNA comprising a DNA sequence selected from (a) SEQ ID 
20 N0:4, SEQ ID NO:5, SEQ ID N0:7, SEQ ID NO : 9 , SEQ ID NO:ll, 
SEQ ID NO:13, SEQ ID N0:14, SEQ ID NO:16, SEQ ID N0:18, SEQ 
ID NO: 19, and (b) sequences which vary from those of (a) 
above due to the degeneracy of the genetic code; vectors 
containing such DNA; and host cells containing and capable 
25 of expressing such vectors. 

A further aspect, of the present invention is a 
polypeptide having a sequence selected from SEQ ID NO: 2, 
SEQ ID N0:3, SEQ ID N0:6, SEQ ID N0:8, SEQ ID NO:10, SEQ ID 
NO:12, SEQ ID NO:15, SEQ ID NO:17, and SEQ ID NO:20. 
30 A further aspect of the present invention is a specific 

pathogen free (SPF) cat infected with feline 
immunodeficiency virus clone JSY3 . 

Brief Description of the Drawings 
Figure 1A - 10 provide the DNA sequence of the FIV-NCSUi 
3 5 insert of the lambda clone. The first three nucleotides 
are part of the lambda vector DNA sequence; the FIV 
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proviral DNA sequence begins with the fourth nucleotide of 
Figure 1A. The gag region (and the pl5, p25, p24a and plO 
regions therein) , the pol region (and two open reading 
frames (orf) therein, and the env region (and the 
5 transmembrane (TM) protein therein) are indicated. 

Figure 2A - 2H aligns the group specific antigen (gag) 
open reading frame of the FIV NCSU X JSY3 molecular clone 
with those of six known FIV strains: FIV PPR, FIV Zl, FIV 
CG, FIV 14, FIV TM1 and FIV TM2 . 

10 Figure 3A - 30 aligns the envelope protein sequence of 

FIV NCSt^ JSY3 molecular clone with those of five known FIV 
strains: FIV 14, FIV Zl, FIV CG, -FIV 19k, and FIV PPR. 

Figure 4 is a schematic of the strategy used for the 
molecular cloning of the FIV JSY3 full-length genome, 

15 beginning with total cellular DNA from FCD4E cells directly 
infected with FIV-NCSU^ 

Detailed Description of the Invention 
A major limitation of the FIV model for the study of 
retroviral infection is the unavailability of molecular 
20 clones that retain the pathogenic characteristics of the 
wild-type viruses. Genetically homogeneous molecular 
clones of FIV that retain the biological and disease- 
causing properties of the pathogenic wild-type populations 
are useful for understanding the molecular basis for 
25, determinants of FIV pathogenesis, treatment of FIV, and the 
re levance of FIV to other retroviral infections. 

The FIV molecular clones FIV- 14 (Olmsted et al . , PNAS 
USA 86:2448 (1989)), FIV-pF34* of FIV- Petaluma (Sparger et 
al., Virology 205:546 (1994)), FIV-pPPR of FIV-PPR (Sparger 
30 et al., Virology 205:546 (1994)), pFTM191CG of FIV-TM1 
(Miyazawa et al., J. Virol. 65:1572 (1991)), and 19K1 of 
FIV Amsterdam-19 (Siebelink et al . , J . Virol. 66:1091 
(1992)), have been reported to be infectious in vivo as 
determined by seroconversion, cell-associated virus, and 
35 the presence of FIV provirus. No clone has been reported 
as pathogenic to the extent that it causes immunodeficiency 
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and increased susceptibility to secondary opportunistic 
infections . 

An isolate of FIV (FIV-NCSUJ that is pathogenic in 
vivo, as measured by a severe loss of CD4+ cells and 
5 development of secondary infections, severe wasting, 
neurological disease, and B-cell lymphomas, has been 
described recently (English et al . , J. Infect. Dis. 
170:543 (1994)). Davidson et al. (Am. J". Pathol. 143:1486 
(1993)) were able to demonstrate that FIV-NCSUj causes a 
10 relatively early and profound state of immunodeficiency, as 
measured by loss of resistance to challenge with a 
Toxoplasma gondii strain with a- low level of virulence. 
This dual FIV-T. gondii infection provides a model with 
which to determine the ability of FIV isolates as well as 
15 molecular clones of FIV to cause immunodeficiency. 

A full length FIV-NCSUj. genome (JSY3) was cloned 
directly from FIV-NCSUi infected feline CD4+ lymphocyte 
(FCD4E) genomic DNA and identified by polymerase chain 
reaction (PCR) amplification with 5 ' -LTR, gag, env, 3'-LTR 
20 primer sets. Supernatant collected from FCD4E cells 
cocultured with JSY3-transf ected Crandell feline kidney 
(CrFK) cells was used as inoculum. Cell -free JSY3 virus 
was cytopathogenic for FCD4E lymphocytes, but did not 
infect CrFK cells in vitro. To determine in vivo 
25 infectivity and pathogenesis, 6 young adult SPF cats were 
inoculated with cell-free JSY3 virus. Provirus was 
detected at 2 wk post-infection, and was still detectable 
at 25 weeks post infection as determined by gag region 
PCR/Southern blot analysis of 'peripheral blood mononuclear 
30 cell (PBMC) lysates. Infectious virus was recovered from 
PBMC at six weeks and 25 weeks post infection, and antibody 
response to FIV was detected by four weeks post infection, 
in the acute phase of infection, JSY3 provirus was found 
only in the CD4+ lymphocyte subset; however, by 14 weeks 
35 post invention the greatest provirus burden was detected in 
B lymphocytes. All six cats were panleukopenic at two 
weeks post infection, CD4+:CD8+ ratios were inverted by six 
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weeks post -infection, and 5/6 cats developed 
lymphadenopathy by ten weeks post infection. To determine 
if the JSY3 molecular clone caused immunodeficiency similar 
to the parent wild-type FIV-NCSU^ the cats were challenged 
5 with the low virulence ME49 strain of Toxoplasma gondii 
(T. gondii) at 29 weeks post infection. Five of six cats 
developed acute respiratory distress and required 
euthanasia. Histopathologic examination of the severely 
affected cats revealed generalized inflammatory reactions 
10 and the presence of T. gondii tachyzoites in multiple 
tissues. None of the six age- and sex-matched SPF cats 
inoculated with only T. gondii developed clinical disease. 
These results indicate that the pathogenesis of the 
molecularly cloned NCSUj. JSY3 isolate is similar to the 
15 wild-type FIV-NCSUi and induces immunodeficiency in cats. 

The JSY3 molecular clone retains the essential in vitro 
and in vivo biological characteristics of the parent virus. 
This clone was obtained from an EMBL3 lambda phage library 
made from FCD4E cells, and the intact genomic structure was 
20 confirmed by PCR comparison with the FIV-pPPR molecular 
clone. The JSY3 molecular clone recovered was highly 
infectious for PBMCs and FCD4E cells but failed to infect 
CrFK cells, thus retaining the tropism of the parent FIV- 
NCSU X virus. Miyazawa et al . (Miyazawa et al., J . Virol. 
25 65:1572 (1991)) and Siebelink et al . (Siebelink et al., J. 

Virol. 66:1091 (1992)) reported that CD4+ lymphoblastoid 
cell line MYA-1 cell -derived or bone marrow-derived 
molecular clones of FIV recovered from transfected CrFK 
cells failed to reinfect CrFK but retained their tropism 
3 0 for PBMC and CD4+ cell cultures. Similarly, the PBMC- 
derived molecular clone FIV-pPPR replicated efficiently in 
PBMCs but did not infect adherent cells such as CrFK or 
G355-5 cells (Phillips et al . , J. Virol. 64:4605 (1990)), 
whereas the FIV-p34 clone, derived from the CrFK-adapted 
3 5 Petaluma isolate, replicated efficiently in feline adherent 
cells, including CrFK cells, but inefficiently in PBMCs 
(Sparger et al., Virology 205 : 546 (1994)). 
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The JSY3 clone retains the in vivo biological 
characteristics of the parent NCSU, virus. Both viruses 
caused a significant inversion of the CD4+/CD8+ ratio by 
six weeks post infection. As reported previously for a 
5 number of biological isolates of FIV (Ackley et al . , J. 
Virol. 64:5652 (1990); Torten et al., J. Virol. 65:2225 
(1991); Willett et al . , Immunology, 78:1 (1993)), including 
the NCSU, isolate (English et al., J. Infect. Dis. 170:543 
(1994); Tompkins et al., J. Am. Vet. Med. Assoc. 199:1311 
10 (199D), the inverted CD4+/CD8+ ratio caused by the JSY3 
clone was the result of a loss of CD4+ lymphocytes and an 
increase in CD8+ lymphocytes. Consistent with the NCSU X 
biological isolate, the JSY3 molecular clone caused a 
strong antibody response to gag and env antigens, and PBMCs 
15 had a high burden of FIV provirus during the acute-stage 
infection. 

The JSY3 clone exhibited a pattern similar to the 
parent FIV-NCSU, (English et al., J. Virol. 67:5175 (1993)) 
of high provirus burden in CD4+ cells during acute-stage 
20 infection, followed by a gradual shift to a panlymphotropic 
pattern during the transition from the acute to the 
asymptomatic stage of infection. 

Derivation of molecular clones of viruses from in vitro 
culture systems poses the risk of selection of some viral 
25 genotypes over others (see Dahl et al., J. Virol. 61:1602 
(1987; Evans et al . , J. Immunol. 138:3415 (1987); Meyerhans 
et al., Cell 58:901 (1989)), or introduction of 
modifications in cultured virus, (see Hirsch et al., Nature 
341-767 (1989); Kodama et al.J J- Virol. 63:4709 (1989)). 
30 For FIV, Sparger et al. (Sparger et al., Virology 205:546 
(1994) ) reported that the P F34 clone derived from the CrFK- 
adapted Petaluma isolate is less pathogenic than the parent 
Petaluma virus isolated from PBMCs. In contrast the FIV- 
pPPR molecular clone derived from PPR-infected PBMCs and 
35 the biological parent PPR isolate show similar 
pathogenicities, including virus burden in PBMCs and 
reduced CD4+/CD8+ ratios (Sparger et al., Virology 205:546 
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(1994)). The JSY3 molecular clone also retains the 
essential biological characteristics of the parent isolate. 
This may be largely because the risk of culture -related 
artifacts was minimized by isolating FIV-NCSt^ genomic DNA 
5 from FIV- inoculated CD4+ lymphocytes (FCD4E cells) . The 
FCD4E cells used had been in laboratory culture for several 
years, but remained interleukin-2 dependent and appeared 
to express a normal rather than a transformed phenotype and 
thus represent as near as possible in vitro the primary in 
10 vivo target of FIV. 

The value of a molecular clone for studies of 
pathogenesis depends on its ability to replicate the 
disease caused by its biological parent virus. The NCSU X 
isolate of FIV causes an acute-stage clinical disease 
15 characterized by fever and lymphadenopathy that is 
transient and resolves as the infection progresses to the 
clinically asymptomatic stage of infection. The JSY3 
acute-stage infection was also characterized by a fever and 
lymphadenopathy that was followed by a clinically 
20 asymptomatic stage. 

Davidson et al. (Am. J. Pathol. 143:1486 (1993)) 
reported that cats infected with FIV-NCSU,. become highly 
susceptible to a normally avirulent strain of T. gondii as 
early as 18 weeks post-FIV infection. This dual FIV-T. 
25 gondii infection was utilized herein to determine if 
infection with clone JSY3 also caused an immunodeficiency 
early in the asymptomatic stage of infection; T. gondii 
infection of cats with prior JSY3 infection resulted in 
severe clinical infection as described below. 
30 The present observations indicate that the JSY3 

molecular clone causes a major impairment in the immune 
response, resulting in enhanced susceptibility to secondary 
infection by r. gondii. Thus, JSY3 possesses all /of the 
essential biological characteristics of the parent NCST^ 
35 isolate, including induction of immunodeficiency. 
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a Thft JSY3 Genome 

The DNA sequence of the JSY3 provirus clone of FIV-NCSU, 
is provided in Figure 1, with the group specific antigen 
(gag) , polymerase (pol) , and envelope protein (env) regions 
marked. The JSY3 proviral DNA sequence consists of 9471 

base pairs (SEQ ID NO:l). 

The coding region of gag is nucleotides 631-1980 of SEQ 
ID NO:l (SEQ ID NO: 4) and encodes a 450 amino acid product 

(SEQ ID NO: 2) . 

The coding region for the pl5 protein is nucleotides 
631-1035 of SEQ ID NO:l (SEQ ID NO:5), with a polypeptide 
product of 135 amino acids (SEQ ID NO:6) . 

The coding region for the p25 protein is nucleotides 
1036-1704 of SEQ ID NO:l (SEQ ID NO:7), with a polypeptide 
15 product of 223 amino acids (SEQ ID N0:8). 

The coding region for the p24a protein is nucleotides 
1264-1305 of SEQ ID NO:l (SEQ ID NO*: 9), with a polypeptide 
product of 14 amino acids (SEQ ID NO: 10). 

The coding region for the plO protein is nucleotides 
1717-1980 of SEQ ID NO:l (SEQ ID N0:11), with a polypeptide 
product of 88 amino acids (SEQ ID N0:12). 

The coding region of pol is amino acids 2151-5991 of 
SEQ ID NO:l (SEQ ID NO: 13) . Two open reading frames (orfs) 
are found in the pol region. Orf 1 is nucleotides 2151- 
5243 of SEQ ID NO:l (SEQ ID N0:14), encoding a product of 
1031 amino acids (SEQ ID NO:15) ; Orf 2 is nucleotides 5239- 
5991 of SEQ ID NO:l (SEQ ID NO: 16) and encodes a product of 
251 amino acids (SEQ ID N0:17) . 

The env coding region is ^nucleotides 6269-8824 of SEQ 
ID NO:l (SEQ ID NO: 18) and encodes a protein of 852 amino 
acids (SEQ ID N0:3). The transmembrane (TM) peptide is 
encoded by nucleotides 8339-8374 of SEQ ID NO:l (SEQ ID 
N0:19), and is 12 amino acids in length (SEQ ID NO:20) . 

Figure 2 aligns the gag open reading frames of the JSY3 
clone of NCSU, (FIV-NCSU) , FIV PPR, FIV Zl, FIV CG, FIV 14, 
FIV TM1, and FIV TM2 . Figure 3 aligns the whole envelope 
protein sequence of clone JSY3 of NCSU, with FIV 14, FIV Zl, 



20 



25 
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FIV CG, FIV 19k , and FIV PPR. 

Amino acid sequences disclosed herein are presented in 

the amino to carboxy direction, from left to right. The 

amino and carboxy groups are not presented in the sequence . 
5 Nucleotide sequences are presented herein by single strand 

only, in the 5' to 3' direction, from left to right. 

Nucleotides and amino acids are represented herein in the 

manner recommended by the IUPAC-IUB Biochemical 

Nomenclature Commission, or (for amino acids) by three 
10 letter code in accordance with 37 C.F.R. §1.822 and 

established usage. See, e.g. Patentln User Manual, 99-102 
(Nov. 1990) (U.S. Patent and Trademark Office, Office of 

the Assistant Commissioner for Patents, Washington D.C. 

20231); U.S. Patent No. 4,871,670 to Hudson et al . at Col. 
15 3, lines 20-43 (applicants specifically intend that the 

disclosure of this and all patent references cited herein 

are to be incorporated herein by reference) . 

Aspects of the present invention are achieved by a 

viral clone having the DNA sequence as provided herein for 
20 Feline Immunodeficiency Virus clone JSY3 . 

B. Identification of Antigenic Fragments 

Antigenic fragments of the present invention are 
peptides which contain at least one epitope (antibody 
binding site) which binds antibodies which bind to the FIV 

25 clone of the present invention. The antigenic fragments 
are preferably capable of inducing an immune response when 
administered to a feline subject, as discussed in greater 
detail below. In additioh, the antigenic fragments 
preferably bind antibodies which do not bind to prior FIV 

3 0 isolates. DNA encoding such antigenic fragments may be 
used to transform host cells to thereby produce such 
antigenic fragments, as explained in greater detail /below . 

Antigenic fragments may be identified by a variety of 
means. A protein from FIV clone JSY3 (such as the envelope 

35 protein, the gag open reading frame product, or a gag 
peptide such as plO, pl5, p24a or p25) may be fragmented 
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with a protease, and the fragments tested to determine 
whether or not a fragment reacts with antiserum against the 
protein. See, e.g., J. Robinson et al., Hoi. Cell Biochem. 
21 23-32 (1978). Another technique is to synthesize 
5 peptides which are fragments of the entire protein, and 
determine whether the individual fragments are recognized 
by neutralizing antibodies against the protein. See, e.g. , 
j Gerin et al., in Vaccines 95: Molecular and Chemical 
Basis of Resistance to Parasitic, Bacterial and Viral 
10 Diseases, 235-239 (Lerner et al . , eds. 1985). Still 
another method useful for obtaining immunogenic fragments 
of a protein is by isolation and identification of 
monoclonal escape mutants. In this strategy, FIV is 
produced in the presence of a monoclonal antibody to the 
15 virus. The only virus which can grow under these 
conditions are those with a mutation in the nucleotide 
sequence which codes for an epitope to which the monoclonal 
antibody binds. A mutant virus which grows under these 
conditions is referred to as the "monoclonal escape 
20 mutant." The monoclonal escape mutant is then sequenced 
and the mutant sequence compared with the nucleotide 
sequence of clone JSY3 to find the specific location of the 
mutation. The mutation is located in a region which codes 
for a protective epitope, or an "immunogenic fragment." 
25 See, e.g., J. Lopez et al . , Location of a Highly Conserved 
Neutralizing Epitope in the F Glycoprotein of Human 
Respiratory Syncytial Virus, J. Virol. 64, 927 (1990). 

r fanatic ^ gingering Techniques 

The production of DNA, vectors, transformed host cells, 
FIV virus, proteins, and protein fragments of the present 
invention by genetic engineering techniques can be carried 
out in accordance with methods known in the art.- See, 
e g U.S. Patent No. 4,761,371 to Bell et al . at Col. 6 
line 3 to Col. 9 line 65; U.S. Patent No. 4,877,729 to 
Clark et al, at Col. 4 line 38 to Col. 7 line 6; U.S. 
Patent No. 4,912,038 to Schilling at Col. 3 line 26 to Col. 
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14 line 12; and U.S. Patent No. 4,879,224 to Wallner at 
Col. 6 line 8 to Col. 8 line 59. 

Vectors are replicable DNA constructs used to either 
amplify or express DNA of the present invention. An 
expression vector is a replicable DNA construct in which 
DNA of the present invention is operably linked to control 
sequences capable of expressing that DNA in a suitable 
host. Generally, control sequences include a 

transcriptional promoter, an optional operator sequence to 
control transcription, a sequence encoding suitable mRNA 
ribosomal binding sites, and sequences which control the 
termination of transcription and translation. Suitable 
vectors include plasmids, viruses (e.g., vaccinia virus, 
adenovirus, baculovirus, cytomegalovirus), phage, and 
integratable DNA fragments (i.e., fragments integratable 
into the host genome by recombination) . 

DNA regions are operably linked or operably associated 
when they are functionally related to each other. For 
example, a promoter is operably linked to a coding sequence 
if it controls the transcription of the sequence; or a 
ribosome binding site is operably linked to a coding 
sequence if it is positioned so as to permit translation. 

Transformed host cells are cells which have been 
transformed or transfected with vectors as described above. 
Transformed host cells ordinarily express the DNA of the 
present invention. As used herein, host cells containing 
the FIV clone JSY3 refer to isolated cells (or cultures of 
such cells) naturally infected with JSY3, including cells 
containing the JSY3 proviral *DNA integrated into cellular 
DNA. Suitable host cells include prokaryote, yeast or 
higher eukaryotic cells such as mammalian cells and insect 
cells . 

Prokaryote host cells include gram negative or gram 
positive organisms, for example Escherichia coli (E. coli) 
or Bacilli. Exemplary host cells are E . coli W3110 (ATCC 
27,325), E. coli B, E. coli X1776 (ATCC 31,537), E. coli 
294 (ATCC 31,446) . A broad variety of suitable prokaryotic 



WO 98/39451 PCT/US98/04147 



10 



20 



25 



35 



-13- 



and microbial vectors are available. E. coli is typically 
transformed using P BR322. Promoters most commonly used in 
recombinant microbial expression vectors include the fi- 
lactamase (penicillinase) and lactose promoter systems 
(Chang et al . , Nature 275:615 (1978); and Goeddel et al., 
Nature 281:544 (1979)), a tryptophan (trp) promoter system 
(Goeddel et al., Nucleic Acids Res. 8:4057 (1980) and EPO 
App Publ. No. 36,776) and the tac promoter (H. De Boer et 
al., Proc. Natl. Acad. Sci. USA 80:21 (1983)). The 
promoter and Shine-Dalgarno sequence are operably linked to 
the DNA of the invention, i.e., they are positioned so as 
to promote transcription of messenger RNA from the DNA. 

Eukaryotic microbes such as yeast cultures may also be 
transformed with vectors of the present invention. See, 
15 eg., U.S. Patent No. 4,745,057. Sac char omyces cerevisiae 
is the most commonly used yeast, although other yeast may 
also be used. Yeast vectors may contain an origin of 
replication from the 2 micron yeast plasmid or an 
autonomously replicating sequence (ARS) , a promoter, a JSY3 
coding region, sequences for polyadenylation and 
transcription termination, and a selection gene. An 
exemplary plasmid is YRp7, (Stinchcomb et al., Nature 
282:39 (1979); Kingsman et al . , Gene 7:141 (1979); 
Tschemper et al . , Gene 10:157 (1980) ) . Suitable promoting 
sequences in yeast vectors include the promoters for 
metallothionein, 3-phosphoglycerate kinase (Hitzeman et 
al J Biol. Chem. 255:2073 (1980) or other glycolytic 
enzymes (Hess et al., J. Adv. Enzyme Reg. 7:149 (1968); and 
Holland et al . , Biochemistry 17 : 4900 (1978)). 
30 Host cells such as insect cells (e.g., cultured 

Spodoptera frugiperda cells) and expression vectors such as 
the baculovirus expression vector (e.g., vectors derived 
from Autographa californica MNPV, Trichoplusia ni MNPV, 
Eachiplusia ou MNPV, or Galleria ou MNPV) may be employed 
in carrying out the present invention, as described in U.S. 
Patents Nos. 4,745,051 and 4,879,236 to Smith et al. In 
general, a baculovirus expression vector comprises a 
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baculovirus genome containing the gene or coding region to 
be expressed inserted into the polyhedrin gene at a 
position ranging from the polyhedrin transcriptional start 
signal to the ATG start site and under the transcriptional 
5 control of a baculovirus polyhedrin promoter. 

Examples of useful mammalian host cell lines are VERO 
and HeLa cells, Chinese hamster ovary (CHO) cell lines, and 
WI13 8, BHK, COS-7, CV, and MDCK cell lines. The 
transcriptional and translational control sequences in 
10 expression vectors to be used in transforming vertebrate 
cells are often provided by viral sources. For example, 
commonly used promoters are . derived from polyoma, 
Adenovirus 2, and Simian Virus 40 (SV40) . See, e.g., U.S. 
Patent No. 4,599,308. An origin of replication may be 
15 provided either by construction of the vector to include an 
exogenous origin, such as may be derived from SV4 0 or other 
viral (e.g. Polyoma, Adenovirus, VSV, or BPV) source, or 
may be provided by the host cell chromosomal replication 
mechanism. If the vector is integrated into the host cell 
20 chromosome, the latter is often sufficient. Rather than 
using vectors which contain viral origins of replication, 
one can transform mammalian cells by the method of 
cotransf ormation with a selectable marker and DNA of the 
present invention, as described in U.S. Pat. No. 4,399,216. 
25 Alternatively, the invention DNA sequences can be 

translated into RNA, which can then be transfected into 
amphibian cells for transcription into protein. Suitable 
amphibian cells include Xenopus oocytes. 

Use of the phrase "substantial sequence similarity" in 
3 0 the present specification and claims means that DNA, RNA or 
amino acid sequences which have slight and non- 
consequential sequence variations from the actual sequences 
disclosed and claimed herein are considered to be 
equivalent to the sequences of the present invention. In 
3 5 this regard, "slight and non-consequential sequence 
variations" mean that "similar" sequences (i.e., the 
sequences that have substantial sequence similarity with 
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the DNA, RNA, or proteins disclosed and claimed herein) 
will be functionally equivalent to the sequences disclosed 
and claimed in the present invention, 
equivalent sequences will function in substantially the 
s same manner to produce substantially the same co.posrt.ons 
a s the nucleic acid and amino acid compositions drsclosed 

and claimed herein. 

AS used herein, the tern 'gene' refers to a D„A 
sequence that incorporates (1) upstream (5M regulatory 
10 signals including the promoter, (2, a coding region 
specifying the product, protein or RNA of the gene, 3) 
downstream <3', regions including.transcription termination 
and polyadenylation signals and .4) associated sequences 
required for efficient and specific expression. 

The term 'promoter' refers to a region of 
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sequence that incorporates the necessary signals for the 
sequence codi ng sequence. THis may 

efficient expression of a coding »=m 

include sequences to which an Ha polymerase binds but is 
not limited to such sequences and may include regions to 
Tien other regulatory proteins bind together with regions 
Evolved in the control of protein translation and may 
include coding sequences. 

„ v.nMnea -* Formulation - ^ 

^IhTlresent invention provides for a variety of 
different vaccines useful for protecting feline , species 
against FIV. Examples include live attenuated clone JSY3 
tirs fixed whole virus, host cells which express virus 
a u en on the surface there? (with the celU , ; ptiona y 
fixed) , preparations of virus fragments, purified P™" 1 ^' 
antTgenic fragments of proteins, and antigenic peptides 
which are derivatives of the antigenic fragments (as 

„ • rtwail below) . These various compounds and 

discussed in detail neiowi 

fixtures are generally referred to herein as active 

a9en L Ive attenuated FIV clone JS Y3 virus is made by serial 
passage of the virus in tissue culture or genetically 
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altered by recombinant techniques, in accordance with known 
procedures. Fixed virus is made by contacting live virus 
(attenuated or unattenuated) to a suitable fixative, such 
as formalin. 

5 Preparations of viral fragments are made by lysing host 

cells, such as E. coli cells, transformed with a vector 
encoding the FIV of the present invention or a portion 
thereof. For example, the vector may encode a JSY3 DNA 
segment which produces hollow virus particles which are 

0 antigenic. The lysate may be used in crude form, partially 
purified, or a particular viral protein (or antigenic 
fragment thereof) such as the envelope protein purified to 
homogeneity, and used as an active agent for a vaccine 
against FIV. 

.5 Host cells such as yeast cells may be transformed with 

vectors of the present invention capable of expressing JSY3 
proteins, or antigenic fragments thereof, on the surface of 
the host cells, and the transformed host cells used as an 
active vaccine agent per se or fixed (e.g., with formalin) 
0 and used as an active agent. 

Antigenic peptides are selected from the group 
consisting of antigenic fragments of FIV clone JSY3 
proteins, such as the envelope protein, the gag open 
reading frame product, and gag peptides (such as plO, pl5, 
25 p24a, p25) and the antigenic equivalents thereof (i.e., 
analogs or derivatives) . Antigenic peptides may be 
chemically synthesized" or produced by recombinant 
techniques. The antigenic fragments are preferably not 
more than 2 0 amino acid residues in length, and are more 
30 preferably not more than 10 amino acid residues in length. 
The antigenic equivalents are selected from the group 
consisting of: (a) modified peptides comprising the 
aforesaid antigenic fragments modified by the inclusion of 
one or more changes to the amino acid sequence thereof; 
35 and (b) longer peptides which incorporate the sequence of 
the aforesaid fragments or the aforesaid modified peptides 
and which have (i) up to four extra amino acid residues 
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attached to the C-terminal end thereof, (ii) up to four 
extra amino acid residues attached to the N-terminal end 
thereof, or (iii) up to four extra amino acid residues 
attached to the C-tertninal end thereof and up to four extra 
amino acid residues attached to the N-terminal end thereof. 

The term "antigenic equivalents , " as used herein, 
refers to proteins or peptides which bind to an antibody 
which binds to the protein or peptide with which 
equivalency is sought to be established. Antibodies which 
are used to select such antigenic equivalents are referred 
to as "selection antibodies" herein. Preferred selection 
antibodies are monoclonal antibodies which bind to clone 
JSY3 , but preferably not to isolates of FIV other than FIV 
strain NCSU, (such as the Petaluma strain isolated by N. 
Pedersen), and most preferably not to other molecular 

clones of FIV NCSU^ 

One or more amino acids of an antigenic peptide 
sequence may be replaced by one or more other amino acids 
which does not affect the antigenicity of that sequence. 
Such changes can be guided by known similarities between 
amino acids in physical features such as charge densxty, 
hydrophobicity/hydrophilicity, size and configuration. For 
example, Thr may be replaced by Ser and vice versa, Asp may 
be Replaced by Glu and vice versa, and Leu may be replaced 
by He and vice versa. 

Antigenic equivalents may be formed by modifying 
reactive groups within a natural sequence or modifying the 
N-terminal amino and/or C-terminal carboxyl group. Such 
equivalents include salts formed with acids and/or bases, 
particularly physiologically acceptable inorganic and 
organic acids and bases. Other equivalents include 
modified carboxyl and/or amino groups on the synthetic 
peptide to produce esters or amides, or amino acid 
protecting groups such as N-t-butoxycarbonyl . Preferred 
modifications are those which provide a more stable, active 
peptide which will be less prone to enzymatic degradation 
in vivo. 
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For use as a vaccine, the active agents of the present 
invention may be administered to the subject by any 
suitable means. Exemplary are by intramuscular injection, 
by subcutaneous injection, by intravenous injection, by 
5 intraperitoneal injection, by oral injection, and by nasal 
spray . 

The amount of active agent administered will depend 
upon factors such as route of administration, species, and 
the use of booster administrations. In general, a dosage 
10 of about .1 to about 100 /xg per pound subject body weight 
may be used, more particularly about l /xg per pound. 

Vaccine formulations of the present invention comprise 
the active agent in a pharmaceutical^ acceptable carrier. 
The active agent is included in the carrier in an amount 
15 effective to protect the subject being treated. 
Pharmaceutically acceptable carriers are preferably liquid, 
particularly aqueous, carriers, such as sodium phosphate 
buffered saline. The vaccine formulation may be stored in 
a sterile glass container sealed with a rubber stopper 
20 through which liquids may be injected and formulations 
withdrawn by syringe. 

Vaccine formulations of the present invention may 
optionally contain one or more adjuvants. Any suitable 
adjuvant can be used, exemplary being aluminum hydroxide, 
25 aluminum phosphate, plant and animal oils, synthetic 
polymers and the like, with the amount of adjuvant 
depending on the nature of the particular adjuvant 
employed. In addition, the vaccine formulations may also 
contain one or more stabilizer, exemplary being 
3 0 carbohydrates such as sorbitol, mannitol, starch, sucrose, 
dextrin, and glucose, proteins such as albumin or casein, 
and buffers such as alkaline metal phosphates and the like. 

E. Infection of Cats with FIV clo ne JSY3 . 

Cats infected with FIV clone JSY3 are useful as a model 
35 system for the study of retroviral infections, such as by 
HIV. Cats used for this purpose are preferably specific 
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pathogen-free (SPF) cats, which are commercially available 

Lorn sources such as Charles River Laboratories and 

u- TWatories infected cats are preferably 
Berkshire Laboratories. 

maintained as a single colony of two or more cats all 
Infected with FIV clone JSY3 • The colony may be 
in a single room with each cat housed in an 
age in accordance with standard practices for the 

9 . • -, 0 jv colony will consist of a 

maintenance of animals. col °^ 

plurality of infected cats, typically from ten, fifteen, 
t enty Lrty or more cats; the number of individual cats 
Til vary according to need. Preferably, all members of 
1 cololy are SPF cats (i.e., free of pathogens other than 

FIV clone JSY3) . jsy3 by any 

SPF cats may be infected wiui 
suitable means, such as by intraperitoneal, ^ravenous. « 
subcutaneous injection with a solution containing FIV clone 
^3 The solution may be blood from a previously mfe ted 
Z a blood fraction containing peripheral biood 
it from a previously infected cat, a 

useful as a model system for immunodef icient states 
associated with retroviral infection because of the rapid 

f CD4"-CD8* ratio caused by JSY3. When used 
inversion of the CD I .CDS ^ ^ 

, model system, the cat or cats into 

JS Y3 is subjected to a treatment, which treatment is a 

jsyj . m K a n nei retroviral infections, and 

candidate f or use , in combing ret ^ ^ ^ 

^ clttl group of' cats infected with FIV clone 
Z but untreated, or placebo treated, may be included as 
a control group. A slowing in the progression of the 
disease in the cats indicates that the treatment may be 
u eful for combating retroviral diseases in other anima 
35 71 ects. Typically, the candidate treatment will hen be 
" Z cted to further screening procedures and t-cologica 
testing to determine whether the treatment may be 
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clinically useful. The treatment to which the cats are 
subjected may be any treatment, such as the administration 
of candidate drugs (e.g., candidate antiretroviral 
compounds) or drug combinations, including small organic 
compounds, peptides, or proteins, which may be administered 
orally or parenterally, or may involve treatments other 
than the administration of drugs such as a biological 
response modifier or a vaccine. The progress of the 
disease in the cats after treatment can be monitored by any 
suitable means, such as examination for inhibition of the 
deterioration of CD4* cell levels, declines in the 
circulating levels of the FIV GAG protein, the weight of 
the cat and its general appearance, etc. 

An advantage of using JSY3 infected cats as a model for 
retroviral disease as described above is that the FIV virus 
is not infectious to humans. A disadvantage of this model 
is that cats are somewhat large animals; mice are much more 
practical as animal models of disease. 

An additional aspect of the present invention is an 
immunodef icient mouse containing feline tissue, which 
feline tissue is capable of infection with feline 
immunodeficiency virus (FIV) . The mouse is infected with 
FIV clone JSY3 , and used as an animal model in essentially 
the same manner as cats as described above. Any suitable 
5 immunodef icient mouse may be employed, such as SCID mice 
(e.g., the C.B.-17 scid/scid mouse) athymic mice such as 
the nude mouse, and mice which have been rendered 
immunodef icient by treatment with radiation. The mouse may 
be deficient in T lymphocytes function alone (e.g., athymic 
0 mice) , but is preferably deficient in both T and B 
lymphocyte function . 

The feline tissue which the immunodef icient mice 
contains preferably comprises one or more of the following: 
feline thymus tissue, feline lymph node tissue, feline 
5 liver cells, feline bone marrow cells, feline peripheral 
blood mononuclear cells such as peripheral blood 
lymphocytes and peripheral blood monocytes, and feline 
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spleen cells. The feline tissue may be introduced into the 
mouse by any suitable means, such as intraperitoneal 
injection, intravenous injection, surgical implantation, 
and combinations thereof. Feline tissue may be introduced 
as organized tissues (e.g., thymus and lymph node) or as 
discrete cells. One example is an immunodef icient mouse 
having feline thymus tissue and/or lymph node tissue 
surgically implanted. Another example is an 

immunodeficient mouse into which peripheral blood 
mononuclear cells have been intraperitoneal^ injected. 

v maanoF t-i<- Probes. 

The FIV clone JSY3 nucleotide sequence can be used to 
generate hybridization probes which specifically bind to 
FIV clone JSY3 genetic material, or the genetic material of 
FIV clones having all of, or essentially all of, the 
identifying characteristics of FIV clone JSY3 , to determine 
the presence of such FIV in cats. The hybridization probe 
m ay be selected so that it does not bind to known FIV 
isolates (such as the Petaluma strain) other than NCSU,, or 
to any FIV isolate or clone other than JSY3 . Hybridization 
probes may be cDNA fragments or oligonucleotides, and may 
be labelled with a detectable group as discussed 
hereinbelow. Pairs of probes which will serve as PCR 
primers for the JSY3 genome or a portion thereof may be 
used in accordance with the process described in U.S. 
Patents Nos. 4,683,202 and 4,683,195. 

For example, an illustrative embodiment of the above 
probes comprises DNA sequence^ set forth in SEQ ID NOS:4, 
5, 7, 9, 11, 13, 14, 16, 18, and 19, or suitable fragments 

30 thereof. ^ , 

The term "labelled" is used herein to refer to the 
conjugating or covalent bonding of any suitable detectable 
group, including enzymes (e.g., horseradish peroxidase ff- 
alucuronidase, alkaline phosphatase, and 

35 galactosidase, , fluorescent labels (e.g., fluoresced 
luciferase), and radiolabels (e.g., "C '»!, *. "P, and 
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35 S) to the compound being labelled. Techniques for 
labelling various compounds, including proteins, peptides, 
and antibodies, are well known. See, e.g., Morrison, 
Methods in Enzymology 32b, 103 (1974); Syvanen et al., J. 
Biol. Chem. 284, 3762 (1973); Bolton and Hunter, Biochem. 
J. 133, 529 (1973) . 



G. DNA Sequence and Genome Organ ization 

Isolated DNA from the JSY3 provirus may be used to 
generate hybridization probes, which may be used in 
diagnostic assays as discussed above. Isolated DNA capable 
of expressing antigenic proteins or antigenic fragments 
thereof may be used for producing proteins which are also 
useful in diagnostic assays. 

An aspect of the present invention is oligonucleotide 
probes which selectively hybridize to DNA encoding a group 
antigen (gag) polypeptide (or an antigenic fragment 
thereof) of FIV clone JSY3 under stringent conditions, 
which probes do not bind to DNA encoding the group antigen 
(gag) polypeptide of the following known FIV strains under 
the same stringency conditions: FIV-Petaluma (U.S. Patent 
No. 5,037,753); FIV-PPR (Phillips et al . , J. Virology, 
64:4605 (1990)); FIV-TM1 and FIV-TM2 (Miyazawa et al., 
Arch. Virology 108:59 (1989)); FIV-UT113 (Verschoor et al., 
J. Cell. Biochem., Suppl . 14D:143 (1990). Conditions which 
will permit other DNA coding for an FIV gag polypeptide to 
hybridize to the DNA of FIV clone JSY3 gag. polypeptide can 
be determined in a routine manner. For example, 

hybridization may be carried out under conditions of 
reduced stringency or even stringent conditions (e.g., 
conditions represented by a wash stringency of 0.3M NaCl, 
0.03M sodium citrate, and 0.1% SDS at 60°C or even 70° C) 
to DNA encoding the gag polypeptide of FIV clone JSY3 
disclosed herein in a standard in situ hybridization assay. 
See J. Sambrook et al., Molecular Cloning, A Laboratory 
Manual (2nd Ed. 1989) (Cold Spring Harbor Laboratory)) . 

In general, DNA which codes for FIV gag polypeptide or 
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antigenic fragments thereof and which hybridizes to DNA 
encoding gag polypeptide (or antigenic fragments thereof) 
of FIV clone JSY3 disclosed herein will have at least 75«, 
80% 85%, or even 90% or more sequence similarity with the 
DNA of the gag polypeptide (or antigenic fragments thereof) 
of FIV clone JSY3 disclosed herein. Further, DNA which 
codes for FIV gag polypeptide (or antigenic fragments 
thereof) , or which codes for a gag polypeptide or antigenic 
fragment coded for by DNA which hybridizes to the DNA which 
codes for FIV clone JSY3 gag polypeptide or antigenic 
fragment thereof, but which differ in codon sequence from 
these due to the degeneracy of the genetic code, are also 
an aspect of this invention. The degeneracy of the genetic 
code, which allows different nucleic acid sequences to code 
for the same protein or peptide, is well known m the 
Uterature. See, 0.8. Patent No. 4,757,006 to Toole 

et al. at Col. 2, Table 1. 

A particular embodiment of the foregoing also disclosed 
herein i. isolated DNA encoding the group antigen (aaa) 
2 0 polypeptide or an antigenic fragment thereof, of ,1V clone 
Lf and isolated DNA encoding the envelope protein or an 
antigenic fragment thereof, where the DNA is: <-> isolated 
DNA encoding group antigen £T£L 
protein, or an antigenic fragment thereof ° 
2 , JSY3 (b) isolated DNA which hybridizes to isolated DNA of 
Z above under stringent conditions and which encodes a 
(eline immunodeficiency virus group antigen (3S3 
polypeptide, envelope protein, or antigenic fragment 
fhefeof with at least 75*. N>*. ™ « even 90% or more 
30 seguence similarity to isolated DNA of (a, above; or ,c, 
isolated DNA differing from the isolated DNAs of (a) and 
" above in nucleotide seance due to the degeneracy of 
h genetic code, and which encodes . -feline 
immunodeficiency virus group antigen P<*»£^ 
35 envelope protein, or antigenic fragment thereof encoded by 
the isolated DNAs of (a) or (bl , above. 

An illustrative embodiment of the foregoing DNA which 
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codes for FIV clone JSY3 gag polypeptide (or antigenic 
fragments thereof) is DNA according to SEQ ID NO: 4 or a 
portion thereof; DNA according to SEQ ID NO : 5 (pl5) or a 
portion thereof; DNA according to SEQ ID NO: 7 (p25) or a 
5 portion thereof; DNA according to SEQ ID NO: 9 (p24a) or a 
portion thereof; DNA according to SEQ ID NO: 11 (plO) or a 
portion thereof. An illustrative embodiment of the 
foregoing DNA which codes for FIV clone JSY3 envelope 
protein (or antigenic fragments thereof) is SEQ ID NO: 18 or 
10 SEQ ID NO: 19. Also disclosed herein are recombinant DNA 
sequences comprising vector DNA and a DNA encoding group 
specific antigen (gag) polypeptides of clone JSY3 , or the 
envelope protein of JSY3 , or antigenic fragments thereof 
(as given above) . 
15 The FIV provirus includes the structural genes for 

group-specific antigens (gag gene) , envelope proteins (env 
gene) arid reverse transcriptase ( pol gene) , as well as 
several short open reading frames similar to those of other 
lentiviruses . Omsted et al . , Proc. Natl. Acad. Sci. USA, 
20 86, 2448 (1989); Olmsted et al . , Proc. Natl. Acad. Sci. 

USA, 86, 8088 (1989). The gag gene of FIV has been 
reported to encode a polyprotein of about 450 amino acids, 
which is subjected to postranslational cleavage. Talbot et 
al., Proc. Natl. Acad. Sci. USA, 86, 5743 (1989); Phillips 
25 et al., J. Virology, 64, 4605 (1990) . The gag gene and its 
predicted protein product has been reported to be highly 
conserved among isolates of FIV. Phillips et al., J. 
Virology, 64, 4605 (1990); Morikawa et al., Virology, 183, 
288 (1991) . FIV gacr gene has been expressed in baculovirus 
30 vectors and assembled into virus-like particles. Morikawa 
et al., Virology, 183, 288 (1991). 

Isolated and purified FIV clone JSY3 group antigen 
( qag ) polypeptide, envelope protein, or antigenic fragments 
thereof are also an aspect of the present invention. These 
35 polypeptides or fragments are coded for by: (a) isolated 
DNA which encodes group antigen (gag) polypeptide or 
envelope protein, or an antigenic fragment thereof, of FIV 
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clone JSY3; (b) isolated DNA which hybridizes to isolated 
DNA of (a) above under stringent conditions and which 
encodes a PIV gjs polypeptide, envelope protein, or 
antigenic fragment thereof with at least 75% sequence 
5 similarity to isolated DNA of (a) above; or (c) isolated 
DNA differing from the isolated DNAs of (a) and (b) above 
in nucleotide sequence due to the degeneracy of the genetic 
code, and which encodes a FIV gag. polypeptide, envelope 
protein, or antigenic fragment thereof encoded by DNAs of 
10 (a) or (b), above. By antigenic polypeptide is meant a 
polypeptide which is able to raise (with the aid of an 
adjuvant if necessary) an antibody response in cats. The 
polypeptide may be a fragment of a polypeptide naturally 
occurring in FIV particles. The fragment may be from a 
15 naturally occurring polypeptide or produced by isolation or 
synthesis of a gene or coding region encoding a desired 
polypeptide and expression within an appropriate expression 
system . 

An illustrative embodiment of the foregoing 
20 polypeptides is the JSY3 group antigen specific polypeptide 
(SEQ ID N0:2) and peptides thereof (SEQ ID NO:6 (pl5) ; SEQ 
ID N0:8 ( P 25); SEQ ID NO:10 ( P 24a) ; SEQ ID NO:12 (plO) ) ; 
and the JSY3 envelope protein (SEQ ID NO: 3) and TM protein 
(SEQ ID NO: 19) . 

25 The present invention is explained in greater detail in 

the non-limiting Examples set forth below. 

EXAMPLE 1 

Mai-prials and Methods 
Viruses. The biological parent virus isolate FIV-NCSU, 
30 (US Patent No. 5,413,927 to Tompkins et al . ) was obtained 
from the peripheral blood mononuclear cells (PBMCs) of a 
cat naturally infected with FIV and has been described 
elsewhere (Davidson et al . , Am. J. Pathol. 143:1486 (1993); 
English et al., J • Virol. 67:5175 (1993); English et al . , 
35 J infect. Dis. 170:543 (1994); Tompkins et al . , J. Am. 
Vet. Med. Assoc. 199:1311 (1991)). The NCSU, isolate (or 
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"NCSU-l") is available from the American Type Culture 
Collection (ATCC Number VR2333) , 12301 Parklawn Drive, 
Rockville, Maryland 20852 USA (deposited in accordance with 
the provisions of the Budapest Treaty, July 23, 1991) . See 
5 U.S. Patent 5,413,927 to Tompkins et al . The FIV-NCS^ 
molecular clone JSY3 inoculum was collected from an FCD4E 
feline lymphocyte culture which had been cocultured with 
transfected Crandell feline kidney (CrFK) cells (see 
below) . 

10 Molecular cloning of the FIV proviral genome. Genomic 

DNA was isolated by equilibrium centrif ugation in CsCl- 
ethidium bromide gradients {Maniatis et al. , Molecular 
cloning: A laboratory manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY) from 5 x 10 7 FCD4E cells 
15 (interleukin- 2 -dependent, FIV-NCSUi- infected feline CD4 + 
lymphocytes) inoculated with FIV-NCSUi obtained from the 
original source cat . - As shown in Figure 4 , FCD4E genomic 
DNA which had been partially digested with Sau3AI and size 
fractionated was cloned into the EMBL3 lambda vector arm. 
20 Genomic libraries were screened primarily by plaque 
hybridization with a gag region PCR product probe (838 bp) 
as described elsewhere (Maniatis et al., Molecular cloning: 
A laboratory manual, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY) . A full-length clone was identified by 
25 PCR of phage suspension with six primer sets designed from 
FIV-14 sequences (GenBank accession no. M25381) . These 
primer sets amplified 5' long terminal repeat, gag, env, 
and 3' long terminal repeat regions under the PCR 
conditions described below. ? The following primers were 
30 used for identification of the full-length lambda clone 
JSY3 (each primer designated by the 5' nucleotide of the 
complete FIV-14 sequence) : 3U (U3) 5'-GGA TGA GTA TTG GAA 
CCC TGA A-3' (SEQ ID NO:21); 337L (U5) 5' -GAT TCC GAG ACC 
TCA CAG GTA A-3' (SEQ ID NO: 22); 447U 5'-AAT AGG GAA GCA 
35 GTA GCA GAC-3' (SEQ ID NO:23) ; 829L 5' -GTA AAT CGC AAA TAA 
CCA ACC-3' (SEQ ID NO-.24); 919U (FIV7) 5' -TGA CGG TGT CTA 
CTG CTG CT-3' (SEQ ID NO:25); 175GL (FIV8) 5'-CAC ACT GGT 
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CCT GAT OCT TTT-3 ' (SEQ ID NO:26) ; 1057U 5' -CCA CAA TAT GTA 
GCA CTT GAG C-3' (SEQ ID NO:27); 1639L S'-GGG TAG TTT CTG 
OCT TAA GGT Q-3' (SEQ ID NO:28); 6938U 5 ' -GGG GGA CCT ACC 
TTG GGG AAT TGG GCT-3' (SEQ ID NO:29) ; 7252L 5' -GGT GAT CAT 
5 GAT CAG TGG GAT TTG TAA TGG GTC TG-3' (SEQ ID NO:30) ; 7252L 
5- -GGT GAT CAT GAT CAG TGG GAT TTG TAA TGG GTC TG-3' (SEQ 
ID NO-31) ; 8859U 5 > -ATA AGG GAG ATA CTG TGC TGA-3 ' (SEQ ID 
NO: 32) ; 9029L 5'- GCG ATC TTC TAA CTC TGT CAT-3' (SEQ ID 
NO: 33) . 

10 DNA transfection. Ten micrograms of lambda clone DNA 

was transfected into CrFK and AH927 (a feline embryonic 
fibroblast cell line) cells by using the cationic liposome 
DOTAP (Boehringer Mannheim, Indianapolis, Ind.) according 
to the manufacturer's protocol. Twenty- four hours after 
transfection, these cells were cocultured for 72 hours with 
FCD4E or concanavalin A (10 /xg/ml) -stimulated normal cat 
PBMCs. FCD4E (or PBMCs) and CrFK (or AH927) cells were 
then cultured separately. Culture supernatant was 
collected at 3- to 4- day intervals and assayed for RT 
activity. Pooled samples for in vivo infection were 
titrated in FCD4E cells by the 50% tissue culture infective 

dose (TCID S0 ) method. 

In vitro infections with JSY3 clone. Cultures of FCD4E 
or DEAE-dextran-treated CrFK cells were inoculated with 
2S cell- free FIV-NCSU, JSY3 clone containing 2 x 10< cpm of RT 
activity. The culture supernatant was collected twice 
weekly and assayed for RT activity. 

in vivo FIV infection. Six 6-month old female cats were 
inoculated intravenously with,10« TCID so s of the JSY3 clone. 
30 Nine age- and sex-matched specific-pathogen-fee (SPF) cats 
were inoculated with wild-type FIV-NCSU, , and nine mock- 
infected SPF cats were used as controls. The wild-type 
FIV-NCSU, infected group was examined up to 18 weeks post 
infection (p.i.) in parallel with the JSY3- infected cats. 
35 Blood sampling. Whole blood was collected by jugular 

venipuncture into sodium citrate anticoagulant tubes. 
Aliquots were removed for complete blood counts and flow 
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cytometry, and plasma was collected for anti-FIV antibody 
assays. PBMCs were purified over Percoll as described 
(Tompkins et al . , Vet. Immunol . Immunopa thol . , 16:1 
(1987)). PBMCs were then cocultured with FCD4E cells for 
5 infectious virus recovery, lysed for provirus detection by 
PCR, or sorted for lymphocyte subset tropism studies. 

Lymphocyte subset analysis by flow cytometry. 
Lymphocyte subsets were determined by two-color flow 
cytometric analysis as described (Davidson et al M Am. J. 
10 Pathol. 143:1486 (1993)) using a panel of monoclonal 
antibodies (MAbs) (Tompkins et al. , Vet. immunol . 
Inununopathol . 26:305 (1990)). Briefly, plasma was removed, 
the cells were washed twice in phosphate-buffered saline 
(PBS) , and MAbs were added in a combination of fluorescein 
15 isothiocyanate- labeled ant i -cat immunoglobulin (Kirkegaard 
& Perry Laboratories, Inc., Gaithersburg, MD) and biotin- 
labeled ant i -pan T-cell antibodies or fluorescein 
isothiocyanate-labeled anti-CD8 and biotin-labeled anti-CD4 
antibodies. Biotin-labeled antibodies were developed with 
2 0 phycoerythrin. Erythrocytes were lysed with fluorescence- 
activated cell sorter (FACS) lysing solution (Becton 
Dickinson I mmuno cytometry Systems, San Jose, CA) , and the 
percent positively stained lymphocytes was determined by 
flow cytometric analysis using a Becton Dickinson FACScan. 
25 The absolute numbers for each lymphocyte subset were 
calculated by multiplying the percent positive cells by the 
total number of lymphocytes, determined by a complete blood 
count and differential performed on the blood sample. 

PCR-Southern blot analysip for FIV-provirus detection. 
3 0 Percoll -purified PBMCs were washed with PBS, and cell 
pellets were stored at -70°C until assayed. Cells (10 6 ) 
were lysed in 200 jil of 1 x PCR buffer and digested with 
600 fig of proteinase K per ml. An 83 8 -bp length of the FIV 
gag region was amplified with the primer set 919U-1756L. 
3 5 Amplification was performed as described previously 
(English et al . , J. Virol. 67:5175 (1993)), with minor 
modifications. Briefly, 2 fil of cell lysate (equivalent to 
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10* cells) was amplified in a 100-fil PCR mixture (1 x PCR 
buffer 1.5 mM MgCl 2 , 200 iM each deoxynucleoside 
triphosphate, 0 . 5 pM each primer, and 2.5 U of Tag DNA 
polymerase over 40 cycles (one cycle was 94°C for 1 minute, 
59 o C for 2 minutes, and 72°C for 1 minute, final extension 
was done at 72°C for 10 minutes) . Amplified products were 
resolved on a 1.2% agarose gel, blotted, and hybridized 
with radiolabeled internal oligonucleotides probe. 

Western blot analysis for plasma antibody to FIV. The 
Western blot (immunoblot) assay was performed as described 
(Novotney et al., AIDS 4:1213 (1990) ) . 

RT activity assay. The Mg 2 * -dependent RT activxty 
assay was performed as described (Novotney et al . , AIDS 
4-1213 (1990)) and is a modification of a procedure of Goff 
et al. (Goff et al., J. Virol. 38:239 (1981)). 

Lymphocyte subset sorting of feline PBMCs. The JSY3 
^lone-infected cat PBMCs were sorted into CD4 + , CD8 + and B 
lymphocyte subsets using MiniMACS (Miltenyi Biotec, 
Sunnyvale, CA) magnetic beads. Percoll- enriched PBMCs were 
divided among three tubes and incubated at 4°C for 30 
minutes with biotin- labeled anti-CD4 or anti-CD8 or antx- 
canine B-cell MAb (B5) for a non-immunoglobulin-posxtxve B- 
cell epitope (English et al., J. Virol. 67:5175 (1993)) 
Streptavidin-conjugated MiniMACS beads were then added, and 
the cells were incubated for an additional 20 minutes at 
4°C and then positively sorted. A fraction of each sorted 
subset was analyzed for purity by two-color flow cytometry. 
Cells were stained with biotin- labeled MAbs, developed with 
phycoerythrin-conjugated streptavidin, and analyzed on the 
FACScan. The remaining sorted lymphocytes were stored at - 
70°C until they were assayed for the presence of FIV 
provirus by PCR-Southern blotting. 

T gondii infection. Twenty-nine weeks after infectxon 
with the JSY3 clone, cats were inoculated via the carotxd 
artery with 10,000 tachyzoites of the ME49 strain of T. 
gondii as described (Davidson et al., Am. J. Pathol. 
14 3:1486 (1993)). Six age- and sex-matched SPF cats were 
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also inoculated with T. gondii as controls. The cats were 
examined daily for clinical signs of illness using scoring 
criteria (Davidson et al . , Am. J. Pathol. 143:1486 (1993)). 
Cats with severe clinical signs indicative of generalized 
5 toxoplasmosis were euthanized by barbiturate overdose. 

Postmortem examination. Following euthanasia, a gross 
necropsy was performed and tissues were sampled for 
microscopic examination. Tissues were fixed in 10% neutral 
buffered formalin, embedded in paraffin, sectioned and 
10 stained with hematoxylin and eosin stain. 



EXAMPLE . 2 

Molecular Cloning and Sequencing 
of the J5Y3 Proviral Genome 
A total of 5 x 10 7 FCD4E cells were infected with wild- 
15 type FIV-NCSUi from the FIV-NCSU,. source cat. Genomic DNA 
from this culture was cloned into the EMBL3 lambda vector 
arm. Primary hybridization-positive clones, determined by 
plaque hybridization with a randomly labeled 83 8 bp FIV gag 
PCR product probe, were screened further by PCR as 
20 described in Example 1. Five microliters of phage plaque 
suspensions of each hybridization-positive clone was 
directly amplified with six different primer sets, and a 
full-length proviral clone was identified (designated 
JSY3). The specificity of each FIV PCR product was 
25 established by comparing it with the FIV-pPPR plasmid clone 
(Phillips et al.,- J. Virol. 64:4605 (1990)). 

The genomic proviral insert was subcloned into pJEM 
vectors, and the provirus gehome was sequenced by primer 
directed sequencing, using techniques as are known in the 
30 art. Nucleotide and predicted amino acid sequences were 
computer analyzed, and open reading frames (orfs) were 
identified. 

The provirus DNA sequence of the JSY3 provirus clone of 
FIV-NCSUi is provided in Figure 1, with the group specific 
35 antigen (gag) , polymerase (pol) , and envelope protein (env) 
regions marked. As shown in Figure 1, the DNA sequence 
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consists of 9471 base pairs (SEQ ID NO:l) . 

The coding region of gag is nucleotides 631-1980 of SEQ 
ID NO:l (SEQ ID NO:4) and encodes a 450 amino acid product 
(SEQ ID NO:2) . 

5 The coding region for the pl5 protein is nucleotides 

631-1035 of SEQ ID NO : 1 (SEQ ID NO:5), with a polypeptide 
product of 135 amino acids (SEQ ID NO: 6). 

The coding region for the p25 protein is nucleotides 
1036-1704 of SEQ ID NO : 1 (SEQ ID NO:7), with a polypeptide 
10 product of 223 amino acids (SEQ ID NO: 8) . 

The coding region for the p24a protein is nucleotides 
1264-1305 of SEQ ID NO:l (SEQ ID-NO:9), with a polypeptide 
product of 14 amino acids (SEQ ID NO: 10) . 

The coding region for the plO protein is nucleotides 
15 1717-1980 of SEQ ID NO:l (SEQ ID NO:ll), with a polypeptide 
product of 88 amino acids (SEQ ID NO:12) . 

The coding region' of pol is amino acids 2151-5991 of 
SEQ ID NO:l (SEQ ID NO: 13) . Two open reading frames (orfs) 
are found in the pol region. Orf 1 is nucleotides 2151- 
20 5243 of SEQ ID NO:l (SEQ ID NO: 14), encoding a product of 
1031 amino acids (SEQ ID NO:15); Orf 2 is nucleotides 5239- 
5991 of SEQ ID NO : 1 (SEQ ID NO: 16) and encodes a product of 
251 amino acids (SEQ ID NO:17). 

The env coding region is nucleotides 6269-8824 of SEQ 
ID NO:l (SEQ ID NO:18) and encodes a protein of 852 amino 
acids (SEQ ID NO:3) . The transmembrane (TM) peptide is 
encoded by nucleotides 8339-8374 of SEQ ID NO:l (SEQ ID 
N0:19), and is 12 amino acids in length (SEQ ID NO:20). 

Figure 2 aligns the gag open reading frames of the JSY3 
30 clone of NCSU, (FIV-NCSU) with known FIV isolates FIV PPR, 
FIV Zl, FIV CG, FIV 14, FIV TM1, and FIV TM2 . Figure 3 
aligns the whole envelope protein sequence of clone JSY3 of 
NCSU, with known FIV isolates FIV 14, FIV Zl, FIV CG, FIV 
19k, and FIV PPR. 
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EXAMPLE 3 

Biological Activity of JSY3 
To determine the biological activity of the JSY3 clone, 
lambda DNA was transfected into CrFK, AH927, and FCD4E 
5 cells, which were then cocultured with FCD4E cells or 
PBMCs . While no RT activity was detected in culture 
supernatants of JSY3 -transfected CrFK or AH927 cells when 
cultured alone, RT activity was detected when the 
transfected cells were cocultured with either PBMCs or 

10 FCD4E cells (data not shown) . The replication kinetics of 
FIV in FCD4E cells is more rapid than in PBMCs because of 
the greater percentage of CD4 + cells in the FCD4E culture. 
Supernatants collected at 15 and 19 days of culture from 
FCD4E cells were filtered (0.2 /im pore size) and stored in 

15 aliquots for use an in vitro and in vivo inocula. These 
inocula were designated the FIV-NCSU^ JSY3 clone. No RT 
activity was detected in the FCD4E cultures directly 
transfected with JSY3 , suggesting that the transfection was 
unsuccessful (data not shown) . 

2 0 To determine the in vitro infect ivity of the JSY3 

clone, FCD4E and CrFK cells were inoculated with cell -free 
JSY3 clone. Similarly to the FIV-NCS^ wild-type virus 
(English et al., J. Virol. 67:5175 (1993)), the JSY3 clone 
replicated efficiently in FCD4E cells, resulting in 
25 syncytium formation and cell death (data not shown) . 
However, -the JSY3 clone was unable to infect CrFK cells. 

EXAMPLE 4 

Tn vivo InfectfivitY of JSY3 
To determine the in vivo infect ivity of the JSY3 
30 molecular clone, six SPF cats were inoculated intravenously 
with 10 s TCID 50 of JSY3 clone. Nine age-matched SPF cats 
were inoculated with 10 6 TCID 50 s of FIV-NCSU^ also produced 
in FCD4E cells. Plasma and PBMCs were collected at various 
times post infection, and tested for antibodies to FIV by 
3 5 Western blotting and tested for cell-associated FIV 
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provirus by PCR. As previously reported (English et al., 
J. Infect. Dis. 170:543 (1994); Tompkins et al., J . Am. 
Vet. Med. Assoc. 199:1311 (1991)) cats infected with FIV- 
NCSUi parent virus were anti-FIV positive by 4 weeks post 
5 infection and were provirus positive by PCR by 2 weeks post 
infection (data not shown) . 

The response of cats infected with the JSY3 clone was 
similar to that of the cats infected with the wild- type. 
By four weeks post infection, all six cats had antibody to 
10 the FIV gag proteins pl7 and p24, and they were still 
antibody positive at 25 weeks post infection (data not 
shown) . The presence of FIV provirus in PBMCs from six 
cats infected with the JSY3 clone was determined by PCR and 
southern analysis. A PBMC lysate (equivalent to 10 4 cells) 
15 was amplified with the gag region primer set 919U-1756L, 
resolved on an agarose gel, and subjected to Southern blot 
analysis with a 5 ' -end- labeled internal probe. Provirus 
was detected in PBMCs from all cats by two weeks post 
infection (data not shown) . All cats remained provirus 
20 positive when the amount of cell lysate in the PCR mixture 
was increased (data not shown) . 

To establish the presence of infectious virus in PBMCs 
from the JSY3- infected cats, PBMCs collected at 6 and 25 
weeks post infection were cocultured with FCD4E cells and 
25 the supernatant s were assayed for RT activity. Syncytium 
formation and cell death were observed in cocultures from 
all six cats at both six and 25 weeks p.i. RT activity was 
detectable in all cocultures by 8 to 10 days and peaked by 
16 to 18 days of culture (dat& not shown) . 

30 EXAMPLE 5 

L ymphocyte Subset Changes in JSY3- infected Cats 

Lymphocyte profiles in naturally and experimentally 
FIV- infected cats are well documented (Ackley et al . , J. 
Virol. 64:5652 (1990); English et al M *7. Infect. Dis. 
35 170:543 (1994); Hoffmann- Fezer et al . , J". Virol. 66:1484 
(1992); Novotney et al . , AIDS 4:1213 (1990); Tompkins et 
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al., J". Am. Vet. Med. Assoc. 199:1311 (1991)). To 
determine whether the JSY3 clone causes hematologic and 
immunologic abnormalities similar to those of the 
biological parent FIV-NCSU X , lymphocyte subset profiles were 
5 analyzed by two-color flow cytometry. As reported for NCSU X 
(English et al . , J. Infect. Dis. 170:543 (1994); Tompkins 
et al., J. Am. Vet. Med. Assoc. 199:1311 (1991)), both the 
biological virus and the JSY3 clone caused a panlymphopenia 
two to four weeks p.i. The parent FIV-NCS^ and the JSY3 
10 molecular clone caused parallel alterations in the 
CD4 + /CD8+ ratio (data not shown). At six weeks p.i., the 
mean CD4+/CD8+ cell ratios (± standard errors) decreased 
from 3.48 ± 0.50 to 1.30+ 0.21 for the parent virus- 
infected cats. By using total cell counts and flow 
15 cytometric analysis of lymphocyte subsets, the decrease in 
the CD4+/CD8+ ratio was determined to be the result of a 
decrease in CD4+ lymphocytes and an increase in CD8+ 
lymphocytes (data not shown) . These results indicate that 
the JSY3 clone- infected cats have hematologic and 
20 immunologic abnormalities, including CD4+ and CD8 + 
lymphocyte changes similar to those of cats infected with 
the biological parent virus. 

EXAMPLE 6 

Tn vivo Lymphocyte Tropism 
25 The in vivo hematopoietic target cells of FIV isolates, 

including NCSU lf have been reported to be CD4+, CD8+, 
monocytes, and B. lymphocytes (Beebe et al . , J. Virol. 
68:3080 (1994); Brown et ali, J. Virol. 65:3359 (1991); 
English et al., J. Virol. 67:5175 (1993)). To determine 
3 0 whether the JSY3 molecular clone has a similar 
panlymphotropism in vivo, PBMCs from JSY3 clone infected 
cats were sorted into CD4+, CD8 + , and B lymphocyte 
populations using antibody-coated magnetic beads. Each 
cell subset was lysed, PCR amplified with the gag region 
35 919U-1756L primer set, and analyzed by Southern blotting. 

As previously reported for the NCSt^ parent virus, FIV 
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provirus was first detected in CD4+ lymphoctyes during the 
acute-stage infection with JSY3 (2 to 4 weeks p. i.) (data 
not shown) . At a later stage of infection (as early as 14 
weeks p.i.), FIV provirus was found in CD8+ and B 
5 lymphocytes in addition to CD4+ lymphocytes, as reported 
for FIV-NCSU, (English et al., J • Virol. 67:5175 (1993)). 
All six JSY3- infected cats showed similar shifts in 
provirus burden from predominately CD4+ cells during the 
acute-stage infection to predominately B cells during the 
10 asymptomatic stage. While CD4 + and CD8 + cells were not 
always positive for provirus under PCR conditions described 
in Example 1, provirus was always able to be detected in 
these cells during the asymptomatic -stage infection by 
increasing cell numbers or using nested primers as 
15 described by English et al., J. Virol. 67:5175 (1993) . The 
JSY3 molecular clone, similar to the parent biological 
isolate, exhibits a CD4+ tropism during the acute-stage 
infection that then shifts to a panlympho tropism as the 
infection progresses. 
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EXAMPLE 7 

■TSVT- Infe rl-.ed Cats 
Acute-stage disease. In the primary phase of infection 
(2 to 16 weeks p.i.), both the JSY3- and the parent 
isolate-infected cats developed low-grade fevers, 
panlymphopenia, neutropenia, and generalized 
lymphadenopathy (data not shown) , as has been reported for 
a number of biological isolates of FIV (Yamamoto et al., 
Am. J. Vet. Res. 49:1246 (198B) ) , including NCSU, (English 
et al., J- Infect. Dis. 170:543 (1994)). 

Clinical response of JSY3 -infected cats to T. gondii 
challenge. Davidson et al., (Am. J. Pathol. 143:1486 
(1993)) reported that FIV-NCSU, causes immune system 
impairment in cats as early as eighteen weeks after 
infection and enhances susceptibility to a primary t. 
gondii infection. To determine if the molecular clone JSY3 
caused immune impairment early in the asymptomatic stage of 
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infection, the cats were parenterally inoculated with the 
ME49 strain of T. gondii 29 weeks after JSY3 infection, 
six age-matched SPF control cats were similarly infected 
with r. gondii. At the time of T. gondii inoculation, all 
six FIV-infected cats were clinically normal; however, they 
had a marked decrease in their CD4 + /CD8+ ratios in 
comparison with preinfection ratios and those of the 
control cats (data not shown) . Only one of six T. gondii- 
infected cats in the non-FIV- inoculated group had positive 
clinical scores, as a result of anorexia and lethargy on 
days 8 to 11 after inoculation. Cats in this group also 
developed multifocal chorioretinitis beginning on days 7 to 
10 after inoculation, which resolved over a three week 
course. The infection was otherwise subclinical in these 
cats. This clinical response is similar to that previously 
reported for healthy cats challenged with the mildly 
virulent ME49 strain of T. gondii (Davidson et al . , Invest. 
Ophthalmol. Visual Sci. 34:3653 (1993); Davidson et al . , 
Am. J. Pathol. 143:1486 (1993)).. 

Five of the six FIV-positive cats challenged with t. 
gondii had positive clinical scores in all three categories 
(attitude, appetite, and respiratory signs) , and the total 
scores were higher than those of the T. gondii control 
group. Beginning on days 6 to 9 after inoculation, three 
i FIV-infected cats challenged with T. gondii developed high 
fevers, depression, and moderate to severe ocular lesions, 
including chorioretinitis with subretinal granuloma 
formation, localized ' retinal detachment, and fibrinous 
anterior uveitis. Severe * and progressive tachypnea, 
D dyspnea, tachycardia, and icterus were noted, and 
interstitial and consolidated lung sounds were auscultated. 
These three cats were euthanized when moribund on day 9 or 
10 after inoculation. Two of the three remaining cats 
developed mild to moderate clinical toxoplasmosis but 
5 recovered. This clinical course of T. gondii infection in 
JSY3 infected cats, including the high morbidity, was 
similar to that reported by Davidson et al . (Am. J. Pathol. 
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143:1486 (1993)) for cats infected with NCSU X . 

Postmortem findings. Postmortem exams were performed 
on the three FIV-T. gondii -inf ected cats that were 
euthanized to confirm that their clinical disease was due 
to toxoplasmosis. One cat had gross evidence of 
interstitial pneumonia. All three animals had foci of 
discoloration in the liver consistent with hepatic 
necrosis, and the hearts contained foci of myocardial 
necrosis. Histologically, lesions were present m the 
!ungs, livers, hearts, and brains of the three cats, and 
were similar to those seen in cats with dual FIV-NCSU.-T. 
gondii infection as described by- Davidson et al., (An. J. 
Pathol 143:1486 (1993)). Except for the heart, T. gondix 
tachyzooites were seen in all tissues examined. The 
tachyzooites were never numerous but most conspicuous as 
clusters inside of macrophages in the regions of severe 
inflammation and necrosis in the brain, lung, and liver. 

The foregoing examples are illustrative of the present 
invention, and are not to be construed as limiting thereof 
The invention is defined by the following claims, with 
equivalents of the claims to be included therein. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Tompkins. Wayne A. F. 

Tompkins. Mary B. 
Yang. Joo-Sung 

(ii) TITLE OF INVENTION: Feline Immunodeficiency Virus Clone 
(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Bell Seltzer Park & Gibson 

(B) STREET: PO Drawer 34009 

(C) CITY: Charlotte 

(D) STATE: North Carolina 

(E) COUNTRY: USA 

(F) ZIP: 28234 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0. Version #1.30 

(vi ) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Bennett. Virginia C. 

(B) REGISTRATION NUMBER: 37.092 

(C) REFERENCE/ DOCKET NUMBER: 5051-332 

(ix) TELECOMMUNICATION INFORMATION: 

(A) " TELEPHONE: 919-420-2200 

(B) TELEFAX: 919-881-3175 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 631.. 1980 
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(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 6269.-8824 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

TGGGATGAGT ATTGGGACCC TGAAGAAATA GAAAGAATGC TTATGGACTA GTGACTGTTT 60 

ACGAACAAAT GATAAATGAT GGAAACAGCT GAGCATGACT CATAGTTAAA GCGCTAGCAG 120 

CTGCTTAACC GCAAAACCAC ATCCTATGTA AAGCTTGCTG ATGACGTATA ATTTGCTCCA 180 

CTGTAAAAGT ATATAACCAG TGCTTTGTGA GACTTCGGGG AGTCTCTCCG TTGAGGACTT 240 

TCGAGTTCTC CCTTGAGGCT CCCACAGATA CAATAMTAT TTGAGATTGA ACCCTGTCAA 300 

GTATCTGTGT AATCTTTTTT ACCTGTGAGG TCTCGGAATC CGGGCCGAGA ACTTCGCAGT 360 

TGGCGCCCGA ACAGGGACTT GATTGAGAGT GATTGAGGAA GTGAAGCTAG AGCAATAGAA 420 

AGCTGTTAAG CAGAACTCCT GCTGACCTAA ATAGGGAAGC AGTAGCAGAC GCTGCTAACA 480 

GTGAGTATCT CTAGTGAAGC AGACTCGAGC TCATAATCAA GTCACTGTTT AAAGGCCCAG 540 

ATAAATTACA TCTGGTGACT CTTCGCGGAC CTTCAAGCCA GGAGATTCGC CGAGGGACAG 600 

TCAACAAGGT AGGAGAGATT CTGCAGCAAC ATG GGG AAC GGA CAG GGG CGA GAT 654 

1 5 

TGG AAA ATG GCC ATT AAG AGA TGT AGT AAT GTT GCT GTA GGA GTA GGG 702 
Trp Lys Met Ala He Lys Arg Cys Ser Asn Val Ala Val Gly Val Gly 
10 15 ^ 

err AAG AGT AAA AAA TTT GGA GAA GGG AAT TTC AGA TGG GCC ATT AGA 750 
Gly £5s s£ f£ $ Phe GlV 61u Gly Asn Phe Arg Trp Ala lie Arg 
25 30 Jt> 

ATP rn aat GTA TCT ACA GGA CGA GAA CCT GGT GAT ATA CCA GAG ACT" 798 
Hrt All ten vll Se T r Th? ™ A?S Glu Pro Gly Asp He Pro Glu Thr 
45 50 00 



TTA TAT CAA CTA AGG TTG GTT ATT TGC GAT TTA CAA GAA AGA AGA GAA 846 
12 Asp GlS 25 Arg S Val He Cys Asp Leu Gin Glu Arg Arg Glu 
60 65 /u 

AAA TTT GGG TCG AGC AAA GAA ATT GAC ATG GCA ATT GTT ACA TTA AAA 894 
Lys Phe Gly Ser Ser Lys Glu He Asp Met Ala He Val Thr Leu Lys 
75 80 b3 

rrr rT . rTA GGA CTT tja AAT ATG ACA GTG TCT ACT GCT GCT 942 
vll SI Ala V™ vll Gly lS Le2 Asn Met Thr Val Ser Thr Ala Ala 
90 95 100 

GCA GCT GAA AAT ATG TAC ACT CAG ATG GGA TTA GAC ACT AGA CCA TCT 990 
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Ala Ala Glu Asn Met Tyr Thr Gin Met Gly Leu Asp Thr Arg Pro Ser 
105 HO I 15 120 

ATG AGA GAA GCA GGA GGA AAA GAG GAA AGC CCT CCA CAG GCA TCT CCT 1038 
Met Arg Glu Ala Gly Gly Lys Glu Glu Ser Pro Pro Gin Ala Ser Pro 
125 130 

ATT CAA ACA GCA AAT GGA GCA CCA CAA TAT GTA GCA CTT GAC CCA AAA 1086 
lie Gin Thr Ala Asn Gly Ala Pro Gin Tyr Val Ala Leu Asp Pro Lys 
140 145 laU 

ATG GTG TCC ATT TTT ATG GAA AAG GCA AGA GAA GGA TTA GGA GGT GAG 1134 
Met Val Ser lie Phe Met Glu Lys Ala Arg Glu Gly Leu Gly Gly Glu 
155 160 I 65 

GAA GTT CAG CTA TGG TTT ACT GCC TTC TCT GCA AAT TTA ACA CCT ACT 1182 
Glu Va Gin Leu Trp Phe Thr Ala Phe Ser Ala Asn Leu Thr Pro Thr 
170 175 180 

cat ATG GCC ACA TTA ATA ATG GCC GCA CCA GGG TGC GCT GCA GAT AAA 1230 
Asp mS Ala Th? LeS Tie Met Ala Ala Pro Gly Cys Ala Ala Asp Lys 
185 190 195 200 

GAA ATA TTG GAT GAA AGC TTA AAG CAA TTG ACG GCA GAG TAT GAT CGT 1278 
Glu lie Leu Asp Glu Ser Leu Lys Gin Leu Thr Ala Glu Tyr Asp Arg 
205 210 ^ 

ACC CAT CCT CCT GAT GGA CCT AGA CCA TTA CCC TAT TTT ACT GCA GCA 1326 
Thr His Pro Pro Asp Gly Pro Arg Pro Leu Pro Tyr Phe Thr Ala Ala 
220 225 " u 

GAA ATT ATG GGT ATA GGA TTA ACT CAA GAA CAA CAA GCA GAA GCA AGA 1374 
Glu lie Met Gly lie Gly Leu Thr Gin Glu Gin Gin A a Glu Ala Arg 
235 240 245 

TTT GCA CCA GCT AGG ATG CAG TGT AGA GCA TGG TAT CTC GAG GCA CTA 1422 
Phe Ala Pro Ala Arg Met Gin Cys Arg Ala Trp Tyr Leu Glu Ala Leu 
250 255 260 

GGA AAA TTG GCC GCC ATA AAA GCT AAG TCT CCT CGA GCT GTG CAG TTA 1470 
Gly Lys Leu Ala Ala He Lys Ala Lys Ser Pro Arg Ala Val Gin Leu 
265 270 27 5 280 

AGA CAA GGA GCT AAG GAA GAT TAT TCA TCC TTT ATA GAC AGA TTG TTT 1518 
Arg GlS GlJ Ala LVs Glu Asp Tyr Ser Ser Phe He Asp Arg Leu Phe 
285 290 

GCC CAA ATA GAT CAA GAA CAA AAT ACA GCT GAA GTT AAG TTA TAT TTA 1566 
Ala Gin lie Asp Gin Glu Gin Asn Thr Ala Glu Val Lys Leu Tyr Leu 
300 305 JiU 

AAA CAG TCA TTA AGC ATG GCT AAT GCT AAT GCA GAA TGT AAA AAG GCA 1614 
tyt Gin Ser Leu Ser Met Ala Asn Ala Asn Ala Glu Cys Lys Lys Ala 
315 320 325 

ATG AGC CAC CTT AAG CCA GAA AGT ACC CTA GAA GAA AAG CTG AGA GCT 1662 
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Met Ser His Leu Lys Pro Glu Ser Thr Leu Glu Glu Lys Leu Arg Ala 
330 335 

TGT CAA GAA GTA GGC TCA CCA GGA TAT AAA ATG CAA CTC TTG GCA GAA 1710 
Cys Gin Glu Val Gly Ser Pro Gly Tyr Lys Met Gin Leu Leu Ala Glu 
345 350 ^ 

fTT CTT ACA AAA GTT CAA GTA GTG CAA TCA AAA GGA TCA GGA CCA GTG 1758 
All S Thf lJS vll Gin Val Val Gin Ser Lys Gly Ser Gly ro Val 
365 370 o/o 

TGT TTC AAC TGT AAA AAA CCA GGA CAT CTA GCA AAA CAG TGT AGA GAT 1806 
SI Se ten Cys \?s Lys Pro Gly His Leu Ala Lys Gin Cys Arg Asp 
380 385 ™ 

rrr aaa AAA TGT AAT AAA TGT GGA AAG CCT GGT CAT TTA GCT GCC AAA 1854 
vll £yt f£ SI S£ LyV Cyl Gly Lys Pro Gly His Leu Ala Ala Lys 
395 400 4U5 

rrr irr taa fifiT GGT AAA AAG AAT TCG GGA AAC TGG AAG GCG GGG CGA 1902 
Ss T?p Gift Gly GlJ \£ C5S Tsn Ser Gly Asn Trp Lys Ala Gly Arg 
410 415 ^ 

rn rrA fifC CCA GTG AAT CAA GTG CAG CAA GCA GTA ATG CCA TCT GCA 1950 
All Ala Ala Pro Val Z Gin Val Gin Gin Ala Val Met Pro Ser A a 

425 430 

CCT CCA ATG GAG GAG AGA CTA TTG GAT TTA TAAATTATAA TAAAGTAGGT 2000 
Pro Pro Met Glu Glu Arg Leu Leu Asp Leu 
445 450 

ACTACTACAA CATTAGAAAA GAGGCCAGAA ATACTTATAT TTGTAAATGG GTACCCTATA 2060 

AAATTTTTAT TAGATACAGG AGCAGATATA ACAATTTTAA ATAGGAGAGA TTTTCAAGTA 2120 

AAAAATTCTA TAGAAAATGG AAGGCAAAAT ATGATTGGAG TAGGAGGAGG AAAGAGAGGA 2180 

ACAAATTATA TCAATGTGCA TTTAGAGATT AGAGATGAAA ATTATAAGAC ACAATGTATA 2240 

TTTGGCAATG TTTGTGTCTT AGAAGATAAC TCATTAATAC AACCATTATT AGGGAGAGAT 2300 

AATATGATTA GATTCAATAT TAGGTTAGTA ATGGCTCAAA TTTCTGACAA GATTCCAATA 2360 

GTAAAAGTAA AAATGAAGGA TCCAAATAAA GGACCTCAAA TAAAACAATG GCCATTAACA 2420 

AATGAAAAAA TTGAAGCTTT AACAGAAATA GTAGAAAGAC TAGAAAGAGA AGGGAAAGTA 2480 

AAAAGAGCAG ATCCAAATAA CCCATGGAAT ACACCAGTAT TTGCAATAAA AAAGAAAAGT 2540 

GGAAAATGGA GAATGCTCAT AGATTTTAGA GAATTGAACA AATTAACTGA GAAAGGGGCA 2600 

GAAGTCCAGT TAGGACTCCC TCATCCTGCT GGATTAAAAA TGAAAAAACA AGTTACTGTG 2660 

CTAGATATAG GAGATGCATA CTTCACTATT CCCTTGGATC CAGACTATGC TCCCTATACT 2720 

GCATTCACAT TACCTAGAAA GAATAATGCA GGACCAGGGA GGAGATATGT ATGGTGCAGT 2780 
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TTACCACAGG GGTGGGTTCT AAGCCCATTG ATATATCAAA GTACTTTAGA TAATATAATA 2840 

CAACCTTTTA TTAGACAAAA TCCTGAGTTA GATATTTATC AATATATGGA TGACATTTAT 2900 

ATAGGATCAA ACTTAAGTAA AAAGGAGCAT AAAGAAAAAG TAGAAGAATT AAGAAAATTG 2960 

TTATTATGGT GGGGATTTGA AACCCCGGAA GACAAATTAC AAGAAGAGCC CCCATATAAG 3020 

TGGATGGGCT ATGAATTACA TCCATTAACA TGGTCAATAC AGCAAAAACA ATTAGAAATT 3080 

CCAGAAAGAC CCACATTAAA TGAACTGCAG AAATTAGCAG GTAAGATAAA CTGGGCCAGT 3140 

CAAACTATCC CAGACTTAAG TATAAAAGAA CTAACTAACA TGATGAGAGG AGATCAGAAG 3200 

TTAGACTCAA TAAGAGAATG GACTGTGGAA GCCAAGAGAG AAGTACAAAA AGCTAAGGAA 3260 

GCTATTGAGA TGCAAGCACA GCTAAATTAT TATGATCCCC ACCGAGAATT ATATGCAAAA 3320 

TTAAGTTTAG TGGGACCACA TCAAATATGT TATCAAGTGT ATCATAAGAA CCCAGAATGT 3380 

ATTTTATGGT ATGGTAAGAT GAATAGACAA AAGAAAAAGG CAGAAAATAC CTGTGATATA 3440 

GCTCTAAGGG CATGTTATAA AATAAGAGAA GAATCTATTA TAAGAATAGG AAAAGAACCA 3500 

ATATATGAAA TACCTACTTC TAGAGAAGCC TGGGAGTCAA ATTTAATTAA TTCACCATAT 3560 

CTTAAGGCCC CACCTCCTGA GGTAGAATAT ATCCATGCTG CTGTGAATAT AAAAAGAGCA 3620 

TTAAGTATGA TAAAAGATGT TCCAATACCA GAAGCAGAAA CGTGGTATAT AGATGGAGGC 3680 

AGAAAGCTAG GAAAAGCAGC AAAAGCAGCC TATTGGACAG ATACAGGGAA GTGGCAAGTA 3740 

ATGGAGTTAG AAGGCAGTAA TCAGAAGGCA GAAGTACAAG CATTATTATT GGCATTAAAA 3800 

GCAGGATCAG AGGAAATGAA TATTATAACA GATTCACAAT ATGTTATAAA TATTATTCTT 3860 

CAACAACCAG ATATGATGGA GGGAATCTGG CAAGAAGTTT TAGAAGAATT GGAGAAAAAA 3920 

ACAGCAATAT TTATAGATTG GGTCCCAGGA CATAAAGGTA TTCCAGGAAA TGAGGAAGTA 3980 

GATAAGCTTT GTCAAACAAT GATGATAATA GAAGGGGATG GGATATTAGA TAAAAGGTCA 4040 

GAAGATGCGG GATATGATTT ATTGGCTGCA AAAGAAATAC ATTTATTGCC AGGAGAGGTA 4100 

AAAGTAATAC CAACAGGGGT AAAGCTAATG CTGCCTAAAG GACATTGGGG ACTAATAATG 4160 

GGAAGAAGCT CGATAGGGAG TAAAGGATTG GATGTATTAG GAGGGGTAAT AGATGAAGGA 4220 

TATCGAGGTG AAATTGGAGT AATAATGATT AATGTATCAA GAAAATCAAT CACCTTAATG -4280 

GAACAACAAA AGATAGCACA ATTAATAATA TTGCCTTGTA AACATGAAGT ATTAGAACAA 4340 

GGAAAAGTTG TAATGGATTC AGAGAGAGGA GACAAAGGTT ATGGGTCAAC AGGAGTATTC 4400 

TCCTCTTGGG TTGACAGGAT TGAGGAAGCA GAAATAAATC ATGAAAAATT TCACTCAGAT 4460 
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CCACMTACT TAA6GACTGA ATTTAATTTA CCCAAGAT6G TTGCAGAAGA GATAAGACGA 4520 

AAGTGCCCTG TATGTAGAAT CAGAGGAGAA CAAGTGGGAG GACAATTGAA AATAGGGCCT 4580 

GGAATATGGC AAGTGGATTG CACACACTTT AATAGTAAGA TAATCATTGT AGCAGTACAT 4640 

GTGGAATCAG GATTTTTATG GGCACAGATA ATTCCACAGG AGACTGCAGA TTGTACAGTC 4700 

AAGGCTCTTC TGCAACTTAT ATGTGCTCAT AATGTTACAG AATTACAAAC AGACAATGGA 4760 

CCAAATTTTA AAAATCAGAA AATGGAAGGT TTATTAAATT TTATGGGAAT AAAACATAAA 4820 

TTAGGGATAC CAGGTAACCC ACAATCACAG GCATTAGTGG AAAATGCTAA TAACACATTA 4880 

AAAGCTTGGA TTCAAAAATT CCTACCAGAG ACTACCTCTC TGGATAATGC TCTGGCCCTA 4940 

GCCCTGTATA GTCTCAACTT TAAACAAAGG GGTAGACTAG GAAGGATGGC CCCTTATGAA 5000 

TTATACATAC AACAAGAATC ATTAAGAATA CAAGACTATT TTTCGCAGAT TCCACAAAAG 5060 

TTAATGATGC AGTGGGTGTA TTACAAAGAT CAAAAAGACA AAAAATGGAA GGGACCAATG 5120 

AGAGTGGAAT ATTGGGGACA AGGATCAGTA TTATTAAAGG ATGAAGAGAA GGGATATTTT 5180 

CTTGTACCTA GGAGACACAT AAGAAGAGTC CCAGAACCCT GCACTCTTCC TGAAGGGGAT 5240 

GAGTGACGAA GATTGGCAGG TAAGTAGAAG ACTCTTTGCA GTGCTCCAAG GAGGAGTACG 5300 

TAGTGCTATG CTATACATAT CTAGACTACC TCCGGACGM AGAGAAAGGT ATAAAAAAGA 5360 

CTTTAAGAAA AGGCTTTTGG AAAAGGAAAC AGGATTCATA CAGAGATTAA GAAAAGCGGA 5420 

AGGAATAAGG TGGAGCTTCC ATACTAGAGA TTATTATATA GGATATGTAA GAGAGATGGT 5480 

GGCCGGATCT AGTCTACCAG ATAGTTTAAG ACTGTATATT TATATAAGCA ATCCATTGTG 5540 

GCACTGGTCA TACCGTCCTG GCCTGACAAA TTTTAATACA GAATGGCCTT TTGTGAATAT 5600 

GTGGATAAAG ACAGGATTCA TGTGGGATGA TATTGAAAGC CAGAATATTT GCAAAGGAGG 5660 

AGAGATTTCA CATGGATGGG GACCTGGMT GGTGGGAATT GTGATAAAAG CTTTTAGTTG- 5720 
TGGAGAAAGA AAGATTGAGG CTACTCCTGT AATGATTAfiA AGAGGAGAAA TAGATCCAAA 
AAAATGGTGT GGAGATTGTT GGAATTTGAT GTGTCTTAGG AACTCACCTC CACAGACTTT 

ACAAAGACTT GCTATGTTGG CATGTGGCGT GCCGGCTAAG GAGTGGCGAG GATGCTGTAA 5900 

TCAACGCTTT GTTTCTCCTT ACAGAACGCC TGCTGATTTG GAGGTCATTC AATCCAAGCC 5960 

CAGCTGGAGT CTATTATGGT CAGGGAGCCT ATGAATGGM GACATACTAA CATTATTTAA 6020 

TAAGGTCACT AAGAAACTAG AAMGGAAAA AGCTATCAGA ATATTTGTAT TAGCACATCA 6080 

ATTAGAAAGG GACAAAGTTA TTAGATTACT ACAAGGATTA GTTTGGAGAC ATAGATTTAA 6140 



5780 
5840 
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GAAACCCCM ACAAAATACT GTTTATGTTG GTTCTGTTGC AAATTCTACT ATTGGCAGTT 6200 

GCAATCTACA TTATCAATAA CTACTGCTTA GAAATACTTA TAATAATATT TCATTTGCAA 6260 

CAATAATT ATG GCA GAA GGA TTT GCA GCC AAT AGA CAA TGG ATA GGA CCA 6310 
Met Ala Glu Gly Phe Ala Ala Asn Arg Gin Trp He Gly Pro 
1 5 10 

GAA GAA GCT GAA GAG TTA TTA GAT TTT GAT ATA GCA ACA CAA ATG AAT 6358 
Glu Glu Ala Glu Glu Leu Leu Asp Phe Asp lie Ala Thr Gin Met Asn 
15 20 25 30 

GAA GAA GGG CCA CTA AAT CCA GGG ATG AAC CCA TTT AGG GTA CCT GGA 6406 
Glu Glu Gly Pro Leu Asn Pro Gly Met Asn Pro Phe Arg Val Pro Gly 
35 40 45 

ATA ACA GAT AAA GAA AAG CAA GAC TAT TGT AAC ATA TTA CAA CCT AAG 6454 
lie Thr Asp Lys Glu Lys Gin Asp Tyr Cys Asn He Leu Gin Pro Lys 
50 55 . dO 



ATT TTT TGG GAT TGT TGG GCA LLA uAA baa ^ mi w« «™ - ■ • 
He Phe Trp Asp Cys Trp Ala Pro Glu Glu Pro Ala Cys Gin Asp Phe 

" 200 c\jo 



TTA CAA GAT TTA CGG AAT GAA CTT CAA GAG GTA AAA CTA GAA GAA GGA 6502 
Leu Gin Asp Leu Arg Asn Glu Leu Gin Glu Val Lys Leu Glu Glu Gly 
65 70 '5 

AAT GCA GGT AAG TTT AGA AGG GCA AGA TAT TTA AGA TAT TCT GAT GAA 6550 
Asn Ala Gly Lys Phe Arg Arg Ala Arg Tyr Leu Arg Tyr Ser Asp Glu 
80 85 

AAT GTG CTA TCT ATA GTC TAT TTG CTA ATA GGA TAT CTA AGA TAT TTA 6598 
Val Leu Ser He Val Tyr Leu Leu lie Gly Tyr Leu Arg Tyr Leu 
95 100 105 iiU 

ATA AAT CGT AGG AGT TTA GGA TCT TTA AGA CAT GAT ATA GAC ATA GAA 6646 
He Asn Arg Arg Ser Leu Gly Ser Leu Arg His Asp He Asp He Glu 
115 120 J-" 

ACA CCT CAA GAG GAA TAT TAT AGT AAT AGT GAA AGG GGT ACC ACA TTA 6694 
Thr Pro Gin Glu Glu Tyr Tyr Ser Asn Ser Glu Arg Gly Thr Thr Leu 
130 135 i4U 

AAT CAA AAA TAT GCG AGA AGA TGT TGT GTT AGC ACA CTT ATT ATG TAT 6742 
Asn Gin Lys Tyr Ala Arg Arg Cys Cys Val Ser Thr Leu He Met Tyr 
145 150 i ibb 

TTA An CTT TTT GCA GTA GGC ATC TGG TGG GGA GCT AGA GCA CAA GTA 
Leu He Leu Phe Ala Val Gly lie Trp Trp Gly Ala Arg Ala Gin Val 
160 165 1/u 

rrr irr, AGA CTT CCC CCT TTA GTA GTT CCA GTA GAA GAA TCA GAA ATA 
vll Trp Arg SI Pro Pro Le£ Val Val Pro Val Glu Glu Ser Glu e 
175 180 185 19U 

ATT TTT TGG GAT TGT TGG GCA CCA GAA GAA CCC GCC TGT CAA GAC TTT 
Cys 
195 



6790 



6838 



6886 
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210 ' 1S 

225 ^ 
240 245 
255 260 



305 31U 



ACA GAC CCA TTA CM ATC CCA CTG ATC AAT TAT ACA TTT GGA CCT AAT 
Thr Asp Pro Leu Gin He Pro Leu lie Asn lyr 

320 325 
CAA ACA TGT ATG TGG AAC ACT TCA CAA ATT CAG GAC CCT GAG ATA CCA 
Gin Thr Cys Met Trp Asn Thr Ser Gin lie bin asp 350 
335 340 



GGA ACA TGG CTT AGA GCA ATC TCG TCA TGG AGA CAA AGG AAT AGA TGG 
Gly Thr Trp Leu Arg Ala He Ser Ser irp *rg u 
385 Jyu 



GAA TGG AGA CCA GAT TTT GAA AGT GAA AAG 6T6 AM ATA TCT CTA AAG 

Glu Trp Arg Pro Asp Phe Glu Ser faiu Lys j 
400 40b 



6934 

6982 

7030 

7078 

7126 

7174 

7222 

7270 

7318 

7366 

7414 

7462 

7510 

7558 
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TAT GGA GAA GTA ACG GGA GCT TGG ATA GAG TTT GGA TGT CAT AGA AAT 7606 
Tyr Glv Glu Val Thr Gly Ala Trp He Glu Phe Gly Cys His Arg Asn 
435 440 445 

AAA TCA AAA CTT CAT GAT GAA GCA AGG TTT AGA ATT AGA TGT AGA TGG 7654 
Lys Ser Lys Leu His Asp Glu Ala Arg Phe Arg He Arg Cys Arg Trp 
450 455 460 

AAT ATA GGG GAG AAT ACC TCA CTC ATT GAT ACA TGT GGA AAC ACT CAA 7702 
Asn He Gly Glu Asn Thr Ser Leu He Asp Thr Cys Gly Asn Thr Gin 
465 470 475 

AAT GTT TCA GGG GCA AAT CCT GTA GAT TGT ACC ATG TAT GCA AAT AAA 7750 
Asn Val Ser Gly Ala Asn Pro Val Asp Cys Thr Met Tyr Ala Asn Lys 
480 485 490 

ATG TAC AAT TGT TCT TTA CAA AAC GGG TTT ACT ATG AAG GTA GAT GAC 7798 
Met Tyr Asn Cys Ser Leu Gin Asn Gly Phe Thr Met Lys Val Asp Asp 
495 500 505 510 

CTT ATT ATG CAT TTC AAT ATG ACA AAA GCT GTA GAA ATG TAT AAT ATT 7846 
Leu He Met His Phe Asn Met Thr Lys Ala Val Glu Met Tyr Asn He 
515 520 525 

GCT GGA AAT TGG TCT TGT ACA TCT GAC TTG CCA CCA ACA TGG GGG TAT 7894 
Ala Gly Asn Trp Ser Cys Thr Ser Asp Leu Pro Pro Thr Trp Gly Tyr 
530 535 540 

ATG AAT TGT AAC TGT ACA AAT AAT AGT AAT GAT AAT ACT AGA ATG GCA 7942 
Met Asn Cys Asn Cys Thr Asn Asn Ser Asn Asp Asn Thr Arg Met Ala 
545 550 555 

TGT CCT AAC AAT CAA GGC ATC TTA AGG AAT TGG TAT AAC CCA GTA GCA 7990 
Cys Pro Asn Asn Gin Gly He Leu Arg Asn Trp Tyr Asn Pro Val Ala 
560 565 570 

GGA TTA CGA CAA TCC TTG GAA AAG TAT CAA GTT GTA AAA CAA CCA GAT 8038 
Gly Leu Arg Gin Ser Leu Glu Lys Tyr Gin Val Val Lys Gin Pro Asp 
575 580 585 590 

TAC TTA GTG GTC CCA GGG GAA GTC ATG GAA TAT AAA ACT AGA AGG AAA 8086 
Tyr Leu Val Val Pro Gly Glu Val Met Glu Tyr Lys Thr Arg Arg Lys 
595 600 » 605 

AGG GCA GCT ATT CAT GTT ATG TTA GCT CTT GCA ACA GTA TTA TCT ATG 8134 
Arg Ala Ala He His Val Met Leu Ala Leu Ala Thr Val Leu Ser Met 
610 615 620 

GCC GGA GCA GGG ACG GGG GCT ACT GCT ATA GGG ATG GTA ACA CAA TAT /8182 
Ala Gly Ala Gly Thr Gly Ala Thr Ala He Gly Met Val Thr Gin Tyr 
625 630 635 

CAC CAA GTT CTA GCA ACC CAT CAA GAA GCT An GAA AAG GTG ACT GAA 8230 
His Gin Val Leu Ala Thr His Gin Glu Ala He Glu Lys Val Thr Glu 
640 645 650 



8278 
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655 660 D 



8326 



675 680 
690 byb 
705 710 
720 725 

770 //b 
785 790 

815 820 
835 

TCT GAA AAA GAG GAG GAA TGATGAAGTA TCTCAGACTT ATTTTATAAG 
Ser Glu Lys Glu Glu Glu 
850 

GGAGATGCTG TGCTGAGTTC TTCCCTTTGA GGAAGGTATG TCATATGAAT CCATTTCAAA 
TCAMTTAAA CTAATAMGT ATGTATTATA AGGTAAAAAG AAAAAAAGAC AAAGAAGAAG 



8374 

8422 

8470 

8518 

8566 

8614 

8662 

8710 

8758 

8806 

.8854 

8914 
8974 
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AAGAAGGAAG AAAGCCTTCA A6MTATGAT GACA6CTTTA GAAGATCGCT TTAGAAAGCT 90 

ATTTGGCACA AATTCTACAA CGGGAGACAG TACAGTGGAA TCTGACGATG MCCTCCTAA 90 

AAAAGAAAAA AGGGTGGACT GGGATGAGTA TTGGGACCCT GAAGAAATAG AAAGAATGCT 91 

TATGGACTAG TGACTGTTTA CGAACAAATG ATAAATGATG GAMCAGCTG AGCATGACTC 92 

ATAGTTAAAG CGCTAGCAGC TGCTTAACCG CAAAACCACA TCCTATGTAA AGCTTGCTGA 92 

TGACGTATAA TTTGCTCCAC TGTAAAAGTA TATAACCAGT GCTTTGTGAG ACTTCGGGGA 92 

GTCTCTCCGT TGAGGACTTT CGAGTTCTCC CTTGAGGCTC CCACAGATAC AATAAATATT 9: 

TGAGATTGAA CCCTGTCAAG TATCTGTGTA ATCTTTTTTA CCTGTGAGGT CTCGGAATCC 9' 

GGGCCGAGAA CTTCGCA 9/ 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Asn Gly Gin Gly Arg Asp Trp Lys Met Ala He Lys Arg Cys 
1 5 10 15 

Ser Asn Val Ala Val Gly Val Gly Gly Lys Ser Lys Lys Phe Gly Glu 
20 25 30 

Glv Asn Phe Arq Trp Ala He Arg Met Ala Asn Val Ser Thr Gly Arg 
35 40 45 

Glu Pro Gly Asp lie Pro Glu Thr Leu Asp Gin Leu Arg Leu Val He 
50 55 60 

Cys Asp Leu Gin Glu Arg Arg Glu Lys Phe Gly Ser Ser Lys Glu lie 
65 70 75 80 

Asp Met Ala He Val Thr Leu Lys Val Phe Ala Val Val Gly Leu Leu 
85 90 95 

Asn Met Thr Val Ser Thr Ala Ala Ala Ala Glu Asn Met Tyr Thr Gin 
100 105 HO 

Met Gly Leu Asp Thr Arg Pro Ser Met Arg Glu Ala Gly Gly Lys Glu 
115 120 125 

Glu Ser Pro Pro Gin Ala Ser Pro He Gin Thr Ala Asn Gly Ala Pro 
130 135 140 
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Gin Tyr Val Ala Leu Asp Pro Lys Met Val Ser lie Phe Met Glu Lys 
145 150 155 i0U 

Ala Arg Glu Gly Leu Gly Gly Glu Glu Val Gin Leu Trp Phe Thr Ala 
165 

Phe Ser Ala Asn Leu Thr Pro Thr Asp Met Ala Thr Leu lie Met Ala 
180 1Bb 

Ala Pro Gly Cys Ala Ala Asp Lys Glu He Leu Asp Glu Ser Leu Lys 
195 200 

Gin Leu Thr Ala Glu Tyr Asp Arg Thr His Pro Pro Asp Gly Pro Arg 

210 215 
Pro Leu Pro Tyr Phe Thr Ala Ala Glu He Met Gly He Gly Leu Thr 
225 230 

Gin Glu Gin Gin Ala Glu Ala Arg Phe Ala Pro Ala Arg Met Gin Cys 

245 cbU ^ JJ 

Arg Ala Trp Tyr Leu Glu Ala Leu Gly Lys Leu Ala Ala He Lys Ala 

260 265 
Lys ^er Pro Arg Ala Val Gin Leu Arg Gin Gly Ala Lys Glu Asp Tyr 
275 280 

Ser Ser Phe He Asp Arg Leu Phe Ala Gin He Asp Gin Glu Gin Asn 

290 295 ^ uu 

Thr Ala Glu Val Lys Leu Tyr Leu Lys Gin Ser Leu Ser Met Ala Asn 
305 . 310 

Ala Asn Ala Glu Cys Lys Lys Ala Met Ser His Leu Lys Pro Glu Ser 

325 

Thr Leu Glu Glu Lys Leu Arg Ala Cys Gin Glu Val Gly Ser Pro Gly 

340 345 
Tyr Lys Met Gin Leu Leu Ala Glu Ala Leu Thr Lys Val Gin Val Val 

355 360 J 

Gin Ser Lys Gly Ser Gly Pro Val Cys Phe Asn Cys Lys Lys Pro Gly 

370 375 JSU 

His Leu Ala Lys Gin Cys Arg Asp Val Lys Lys Cys Asn Lys Cys Gly 
385 390 

Lys Pro Gly His Leu Ala Ala Lys Cys Trp Gin Gly Gly Lys Lys & Asn 
405 



Ser Gly Asn Trp Lys Ala Gly Arg Ala Ala Ala Pro Val Asn Gin Val 

420 425 

Gin Gin Ala Val Met Pro Ser Ala Pro Pro Met Glu Glu Arg Leu Lei 
435 440 
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Asp Leu 
450 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ala Glu Gly Phe Ala Ala Asn Arg Gin Trp He Gly Pro Glu Glu 
1 5 10 15 

Ala Glu Glu Leu Leu Asp Phe Asp lie Ala Thr Gin Met Asn Glu Glu 
20 25 30 

Glv Pro Leu Asn Pro Gly Met Asn Pro Phe Arg Val Pro Gly He Thr 
35 40 45 

Asp Lys Glu Lys Gin Asp Tyr'Cys Asn He Leu Gin Pro Lys Leu Gin 
50 55 60 

Asd Leu Arg Asn Glu Leu Gin Glu Val Lys Leu Glu Glu Gly Asn Ala 
65 70 75 80 

Gly Lys Phe Arg Arg Ala Arg Tyr Leu Arg Tyr Ser Asp Glu Asn Val 
85 90 y» 

Leu Ser He Val Tyr Leu Leu He Gly Tyr Leu Arg Tyr Leu He Asn 
100 105 HO 

Arg Arg Ser Leu Gly Ser Leu Arg His Asp He Asp He Glu Thr Pro 
115 120 125 

Gin Glu Glu Tyr Tyr Ser Asn Ser Glu Arg Gly Thr Thr Leu Asn Gin 
130 135 140 

Lys Tyr Ala Arg Arg Cys Cys Val Ser Thr LeU He Met Tyr Leu He 
145 150 155 160 

Leu Phe Ala Val Gly He Trp Trp Gly Ala Arg Ala Gin Val Val Trp 
165 170 175 

Arg Leu Pro Pro Leu Val Val Pro Val Glu Glu Ser Glu He He Phe 
180 185 190 

Trp Asp Cys Trp Ala Pro Glu Glu Pro Ala Cys Gin Asp Phe Leu Gly 

Ala Met He His Leu Lys Ala Ser Thr Asn He Ser He Gin Glu Gly 
210 215 220 
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Pro Thr Leu Gly Asn Trp Ala Arg Glu He Trp Gly Thr Leu Phe Lys 
225 230 235 £W 

Lys Ala Thr Arg Gin Cys Arg Arg Gly Arg He Trp Lys Arg Trp Asn 
J 245 250 

Glu Thr He Thr Gly Pro Leu Gly Cys Ala Asn Asn Thr Cys Tyr Asn 
260 265 <vu 

He Ser Val He Val Pro Asp Tyr Gin Cys Tyr Leu Asp Arg Val Asp 
27 5 280 28b 

Thr Trp Leu Gin Gly Lys Val Asn He Ser Leu Cys Leu Thr Gly Gly 
290 295 300 

Lys Met Leu Tyr Asn Lys Tyr Thr Lys Gin Leu Ser Tyr Cys Thr Asp 

305 310 

Pro Leu Gin lie Pro Leu He Asn Tyr Thr Phe Gly Pro Asn Gin Thr 
325 330 

Cys Met Trp Asn Thr Ser Gin lie Gin Asp Pro Glu He Pro Lys Cys 
340 345 JbU 

Gly Trp Trp Asn Gin Arg Ala Tyr Tyr Lys Asn Cys Lys Trp Glu Lys 
y 355 360 365 

Thr Asp Val Lys Phe His Cys Gin Arg Thr Gin Ser Gin Pro Gly Thr 

370 375 M ^ 

Trp Leu Arg Ala He Ser Ser Trp Arg Gin Arg Asn Arg Trp Glu Trp 
385 390 395 ™v 

Arg Pro Asp Phe Glu Ser Glu Lys Val Lys He Ser Leu Lys Cys Asn 
405 410 4ib 

Ser Thr Lys Asn Leu Thr Phe Ala Met Arg Ser Ser Gly Asp Tyr Gly 
420 ^25 4JU 

Glu Val Thr Gly Ala Trp He Glu Phe Gly Cys His Arg Asn Lys Ser 
435 440 44b 

Lys Leu His Asp Glu Ala Arg Phe Arg He Arg Cys Arg Trp Asn He 
450 455 460 

Gly Glu Asn Thr Ser Leu He Asp Thr Cys Gly Asn Thr Gin Asn Val 
465 470 475 

Ser Gly Ala Asn Pro Val Asp Cys Thr Met Tyr Ala Asn Lys Met Tyr 
485 490 4y; > 

Asn Cys Ser Leu Gin Asn Gly Phe Thr Met Lys Val Asp Asp Leu He 
500 505 aw 

Met His Phe Asn Met Thr Lys Ala Val Glu Met Tyr Asn He Ala Gly 
515 520 b " 
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Asn Trp Ser Cys Thr Ser Asp Leu Pro Pro Thr Trp Gly Tyr Met Asn 
530 535 540 

Cys Asn Cys Thr Asn Asn Ser Asn Asp Asn Thr Arg Met Ala Cys Pro 
545 550 555 560 

Asn Asn Gin Gly He Leu Arg Asn Trp Tyr Asn Pro Val Ala Gly Leu 
565 570 575 

Arq Gin Ser Leu Glu Lys Tyr Gin Val Val Lys Gin Pro Asp Tyr Leu 
580 585 590 

Val Val Pro Gly Glu Val Met Glu Tyr Lys Thr Arg Arg Lys Arg Ala 
595 600 605 

Ala He His Val Met Leu Ala Leu Ala Thr Val Leu Ser Met Ala Gly 
610 615 620 

Ala Gly Thr Gly Ala Thr Ala He Gly Met Val Thr Gin Tyr His Gin 
625 630 635 640 

Val Leu Ala Thr His Gin Glu Ala lie Glu Lys Val Thr Glu Ala Leu 
645 650 655 

Lvs lie Asn Asn Leu Arg Leu Val Thr Leu Glu His Gin Val Leu Val 
660 665 670 

He Gly Leu Lys Val Glu Ala Met Glu Lys Phe Leu Tyr Thr Ala Phe 
675 680 685 

Ala Met Gin Glu Leu Gly Cys Asn Gin Asn Gin Phe Phe Cys Lys Val 
690 695 700 

Pro Pro Glu Leu Trp Met Arg Tyr Asn Met Ser He Asn Gin Thr lie 
705 710 715 720 

Trp Asn His Gly Asn lie Thr Leu Gly GJu Trp Tyr Asn Gin Thr Lys 

Asp Leu Gin Gin Lys Phe Tyr Glu He He Met Asp lie Glu Gin Asn 
740 745 /bU 

Asn Val Gin Gly Lys Lys Gly He Gin Gin Le!i Gin Lys Trp Glu Asp 
755 760 765 

Trp Val Gly Trp He Gly Asn He Pro Gin Tyr Leu Lys Gly Leu Leu 
770 775 780 

Gly Gly He Leu Gly He Gly Leu Gly Val Leu Leu Leu He Leu Cys 
785 790 795 800 

Leu Pro Thr Leu Val Asp Cys He Arg Asn Cys He His Lys He Leu 
805 810 815 

Glv Tyr Thr Val He Ala Met Pro Glu Val Glu Gly Glu Glu lie Gin 
J J 820 825 830 
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Pro Gin Met 61 u Leu Arg Arg Asn Gly Arg Gin Cys Gly He Ser Glu 
835 840 »4b 

Lys Glu Glu Glu 
850 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
ATGGGGAACG GACAGGGGCG AGATTGGAAA ATGGCCATTA AGAGATGTAG TAATGTTGCT 60 

GTAGGAGTAG GGGGGAAGAG TAAAAAATTT GGAGAAGGGA ATTTCAGATG GGCCATTAGA 120 

ATGGCTAATG TATCTACAGG ACGAGAACCT GGTGATATAC CAGAGACTTT AGATCAACTA 180 

AGGTTGGTTA TTTGCGATTT ACAAGAAAGA AGAGAAAAAT TTGGGTCGAG CAAAGAAATT 240 

GACATGGCAA TTGTTACATT AAAAGTCTTT GCGGTAGTAG GACTTTTAAA TATGACAGTG 300 

TCTACTGCTG CTGCAGCTGA AAATATGTAC ACTCAGATGG GATTAGACAC TAGACCATCT 360 

ATGAGAGAAG CAGGAGGAAA AGAGGAAAGC CCTCCACAGG CATCTCCTAT TCAAACAGCA 420 

AATGGAGCAC CACAATATGT AGCACTTGAC CCAAAAATGG TGTCCATTTT TATGGAAAAG 480 

GCAAGAGAAG GATTAGGAGG TGAGGAAGTT CAGCTATGGT TTACTGCCTT CTCTGCAAAT 540 

TTAACACCTA CTGACATGGC CACATTAATA ATGGCCGCAC CAGGGTGCGC TGCAGATAAA 600 

GAAATATTGG ATGAAAGCTT AAAGCAATTG ACGGCAGAGT ATGATCGTAC CCATCCTCCT 660 
GATGGACCTA GACCATTACC CTATTTTACT GCAGCAGAAA TTATGGGTAT AGGATTAACT 720 
CAAGAACAAC AAGCAGAAGC AAGATTTGCA CCAGCTAGGA TGCAGTGTAG AGCATGGTAT 780 
CTCGAGGCAC TAGGAAAATT GGCCGCCATA AAAGCTAAGT CTCCTCGAGC TGTGCAGTTA 840 
AGACAAGGAG CTAAGGAAGA TTATTCATCC TTTATAGACA GATTGTTTGC CCAAATAGAT 900 
CAAGAACAAA ATACAGCTGA AGTTAAGTTA TATnAAAAC AGTCATTAAG CATGGCTAAT 960 

GCTAATGCAG AATGTAAAAA GGCAATGAGC CACCTTAAGC CAGAAAGTAC CCTAGAAGAA 1020 

AAGCTGAGAG CTTGTCAAGA AGTAGGCTCA CCAGGATATA AAATGCAACT CTTGGCAGAA 1080 
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6CTCTTACM AAGTTCAAGT AGTGCAATCA AAAGGATCAG GACCAGTGTG TTTCAACTGT 1140 

AAAAAACCAG GACATCTAGC AAAACAGTGT AGAGATGTGA AAAAATGTAA TAAATGTGGA 1200 

AAGCCTGGTC ATTTAGCTGC CAAATGCTGG CAAGGTGGTA AAAAGMTTC GGGAAACTGG 1260 

AAGGCGGGGC GAGCTGCAGC CCCAGTGAAT CAAGTGCAGC AAGCAGTAAT GCCATCTGCA 1320 

CCTCCAATGG AGGAGAGACT ATTGGATTTA 1350 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .405 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG GGG AAC GGA CAG GGG CGA GAT TGG AAA ATG GCC ATT AAG AGA TGT 48 
Met Gly Asn Gly Gin Gly Arg Asp Trp Lys Met Ala lie Lys Arg Cys 
855 860 865 

AGT AAT GTT GCT GTA GGA GTA GGG GGG AAG AGT AAA AAA TTT GGA GAA 96 
Ser Asn Val Ala Val Gly Val Gly Gly Lys Ser Lys Lys Phe Gly Glu 
870 875 880 

GGG AAT TTC AGA TGG GCC An AGA ATG GCT AAT GTA TCT ACA GGA CGA 144 
Gly Asn Phe Arg Trp Ala He Arg Met Ala Asn Val Ser Thr Gly Arg 
885 890 895 900 

GAA CCT GGT GAT ATA CCA GAG ACT TTA GAT CAA CTA AGG TTG GTT ATT 192 
Glu Pro Gly Asp He Pro Glu Thr Leu Asp Gin Leu Arg Leu Val He 
905 910 91b 

TGC GAT TTA CAA GAA AGA AGA GAA AAA TTT GGG TCG AGC AAA GAA ATT 240 
Cys Asp Leu Gin Glu Arg Arg Glu Lys Phe Gly Ser Ser Lys Glu He 
920 925 93U 

GAC ATG GCA ATT GTT ACA TTA AAA GTC TTT GCG GTA GTA GGA CTT TTA 288 
Asd Met Ala He Val Thr Leu Lys Val Phe Ala Val Val Gly Leu Leu 
935 940 945 

AAT ATG ACA GTG TCT ACT GCT GCT GCA GCT GAA AAT ATG TAC ACT CAG 336 
Asn Met Thr Val Ser Thr Ala Ala Ala Ala Glu Asn Met Tyr Thr Gin 
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955 960 



nrr 9/0 



965 

GAA AGC CCT CCA CAG GCA TCT 
Glu Ser Pro Pro Gin Ala Ser 
985 



(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 



Met Gly Asn Gly Gin Gly Arg Asp Trp Lys Met Ala He Lys Arg Cys 

I 5 iU 

Ser Asn Val Ala Val Gly Val Gly Gly Lys Ser Lys Lys Phe Gly Glu 



20 



Gly Asn Phe Arg Trp Ala He Arg Met Ala Asn Val Ser Thr Gly Arg 
35 

Glu Pro Gly Asp He Pro Glu Thr Leu Asp Gin Leu Arg Leu Val He 

50 55 
Cys Asp Leu Gin Glu Arg Arg Glu Lys Phe Gly Ser Ser Lys Glu He 
65 70 /b 

Asp Met Ala He Val Thr Leu Lys Val Phe Air Val Val Gly Leu Leu 
85 

ten Met Thr Val Ser Thr Ala Ala Ala Ala Glu Asn Met Tyr Thr Gin 



100 105 * 110 

Met Gly Leu Asp mr ma " U ^ l& 



Gly Leu Asp Thr Ar 9 Pro Ser Met Arg Glu Ala Gly Gly Lys Glu 
115 120 



Glu Ser Pro Pro Gin Ala Ser 
130 135 



(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 669 base pairs 



384 



405 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..669 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



48 



CCT ATT CAA ACA GCA AAT GGA GCA CCA CAA TAT GTA GCA CTT GAC CCA 
Pro He Gin Thr Ala Asn Gly Ala Pro Gin Tyr Val Ala Leu Asp Pro 
140 145 150 

AAA ATG GTG TCC ATT TTT ATG GAA AAG GCA AGA GAA GGA TTA GGA GGT 96 
Lvs Met Val Ser He Phe Met Glu Lys Ala Arg Glu Gly Leu Gly Gly 
155 160 165 

GAG GAA GTT CAG CTA TGG TTT ACT GCC TTC TCT GCA AAT TTA ACA CCT 144 
Glu Glu Val Gin Leu Trp Phe Thr Ala Phe Ser Ala Asn Leu Thr Pro 
170 175 180 

ACT GAC ATG GCC ACA TTA ATA ATG GCC GCA CCA GGG TGC GCT GCA GAT 192 
Thr Asd Met Ala Thr Leu He Met Ala Ala Pro Gly Cys Ala Ala Asp 
185 190 195 

AAA GAA ATA TTG GAT GAA AGC TTA AAG CAA TTG ACG GCA GAG TAT GAT 240 
Lys Glu He Leu Asp Glu Ser Leu Lys Gin Leu Thr Ala Glu Tyr Asp 
200 205 210 215 

CGT ACC CAT CCT CCT GAT GGA CCT AGA CCA TTA CCC TAT TTT ACT GCA 288 
Arg Thr His Pro Pro Asp Gly Pro Arg Pro Leu Pro Tyr Phe Thr Ala 
220 225 230 

GCA GAA ATT ATG GGT ATA GGA TTA ACT CAA GAA CAA CAA GCA GAA GCA 336 
Ala Glu He Met Gly He Gly Leu Thr Gin Glu Gin Gin Ala Glu Ala 
235 240 245 

AGA TTT GCA CCA GCT AGG ATG CAG TGT AGA GCA TGG TAT CTC GAG GCA 384 
Ara Phe Ala Pro Ala Arg Met Gin Cys Arg Ala Trp Tyr Leu Glu Ala 
y 250 255 260 

CTA GGA AAA TTG GCC GCC ATA AAA GCT AAG TCT CCT CGA GCT GTG CAG 432 
Leu Gly Lys Leu Ala Ala He Lys Ala Lys Ser Pro Arg Ala Val Gin 
265 270 275 

TTA AGA CAA GGA GCT AAG GAA GAT TAT TCA TCC TTT ATA GAC AGA TTG 
Leu Arg Gin Gly Ala Lys Glu Asp Tyr Ser Ser Phe He Asp Arg Leu 
280 285 290 

TTT GCC CAA ATA GAT CAA GAA CAA AAT ACA GCT GAA GTT AAG TTA TAT 
Phe Ala Gin lie Asp Gin Glu Gin Asn Thr Ala Glu Val Lys Leu Tyr 



480 



528 
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305 310 

tta aaa pap TfA TTA AGC ATG GCT AAT GCT AAT GCA GAA TGT AAA AAG 
12 t£ Gin Se? leu Ser Met All Asn Ala Asn Ala Glu Cys Lys Lys 
315 320 

rrA ATG AGC CAC CTT AAG CCA GAA AGT ACC CTA GAA GAA AAG CTG AGA 
Ala Met Ser His Lei Lys Pro Glu Ser Thr Leu Glu Glu Lys Leu Arg 
330 33 ^ 



rrr TGT CAA GAA GTA GGC TCA CCA GGA TAT AAA ATG CAA CTC TTG 
All SI GlS Glu Val Gly Ser Pro Gly Tyr Lys Met Gin Leu Leu 
345 350 J0 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Pro He Gin Thr Ala Asn Gly Ala Pro Gin Tyr Val Ala Leu Asj> Pro 

I 5 iU 

Lys Met Val Ser He Phe Met Glu Lys Ala Arg Glu Gly Leu Gly Gly 

J 20 25 ou 

Glu Glu Val Gin Leu Trp Phe Thr Ala Phe Ser Ala Asn Leu Thr Pro 

35 40 
Thr Asp Met Ala Thr Leu He Met Ala Ala Pro Gly Cys Ala Ala Asp 

50 55 
Lys Glu lie Leu Asp Glu Ser Leu Lys Gin Leu Thr Ala Glu Tyr-Asp 
65 70 

Arg Thr His Pro Pro Asp Gly Pro Arg Pro Leu Pro Tyr Phe Thr Ala 

85 yu 

Ala Glu lie Met Gly He Gly Leu Thr Gin Glu Gin Gin Ala Glu Ala 

100 105 
Arg Phe Ala Pro Ala Arg Met Gin Cys Arg Ala Trp Tyr Leu Glu Ala 

115 120 
Leu Gly Lys Leu Ala Ala He Lys Ala Lys Ser Pro Arg Ala Val Gin 

130 135 
Leu Arg Gin Gly Ala Lys Glu Asp Tyr Ser Ser Phe He Asp Arg Leu 
145 150 



576 



624 



669 
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Phe Ala Gin He Asp Gin Glu Gin Asn Thr Ala 61 u Val Lys Leu Tyr 
165 170 1/5 

Leu Lvs Gin Ser Leu Ser Met Ala Asn Ala Asn Ala Glu Cys Lys Lys 
180 185 190 

Ala Met Ser His Leu Lys Pro Glu Ser Thr Leu Glu Glu Lys Leu Arg 
195 200 205 

Ala Cys Gin Glu Val Gly Ser Pro Gly Tyr Lys Met Gin Leu Leu 
210 215 220 



(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(1x) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..42 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

GCA GAG TAT GAT CGT ACC CAT CCT CCT GAT GGA CCT AGA CCA 42 
Ala Glu Tyr Asp Arg Thr His Pro Pro Asp Gly Pro Arg Pro 
225 230 235 



(2) INFORMATION FOR SEQ ID-NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ala Glu Tyr Asp Arg Thr His Pro Pro Asp Gly Pro Arg Pro 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: COS 

(B) LOCATION: 1..264 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
15 20 



85 90 

ATG GAG GAG AGA CTA TTG GAT TTA 
Met Glu Glu Arg Leu Leu Asp Leu 
95 100 

(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
Thr Lys Val Gin Val Val Gin Ser Lys Gly Ser Gly Pro Val Cys Phe 

1 5 
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Asn Cys Lys Lys Pro Gly His Leu Ala Lys Gin Cys Arg Asp Val Lys 
20 25 30 

Lvs Cys Asn Lys Cys Gly Lys Pro Gly His Leu Ala Ala Lys Cys Trp 
35 40 45 

Gin Gly Gly Lys Lys Asn Ser Gly Asn Trp Lys Ala Gly Arg Ala Ala 
50 55 60 

Ala Pro Val Asn Gin Val Gin Gin Ala Val Met Pro Ser Ala Pro Pro 
65 70 75 80 

Met Glu Glu Arg Leu Leu Asp Leu 
85 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3841 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

ATGATTGGAG TAGGAGGAGG AAAGAGAGGA ACAAATTATA TCAATGTGCA TTTAGAGATT 60 

AGAGATGAAA ATTATAAGAC ACAATGTATA TTTGGCAATG TTTGTGTCTT AGAAGATAAC 120 

TCATTAATAC AACCATTATT AGGGAGAGAT AATATGATTA GATTCAATAT TAGGTTAGTA 180 

ATGGCTCAAA TTTCTGACAA GATTCCAATA GTAAAAGTAA AAATGAAGGA TCCAAATAAA 240 

GGACCTCAAA TAAAACAATG GCCATTAACA AATGAAAAAA ' TTGAAGCTTT AACAGAAATA 300 

GTAGAAAGAC TAGAAAGAGA AGGGAAAGTA AAAAGAGCAG ATCCAAATAA CCCATGGAAT 360 

ACACCAGTAT TTGCAATAAA AAAGAAAAGT GGAAAATGGA GAATGCTCAT AGATTTTAGA 420 

GAATTGAACA AATTAACTGA GAAAGGGGCA GAAGTCCAGT TAGGACTCCC TCATCCTGCT 480 

GGATTAAAAA TGAAAAAACA AGTTACTGTG CTAGATATAG GAGATGCATA CTTCACTATT 540 

CCCTTGGATC CAGACTATGC TCCCTATACT GCATTCACAT TACCTAGAAA GAATAATGCA ' 600 

GGACCAGGGA GGAGATATGT ATGGTGCAGT TTACCACAGG GGTGGGTTCT AAGCCCATTG 660 

ATATATCAAA GTACTTTAGA TAATATAATA CAACCTTTTA TTAGACAAAA TCCTGAGTTA 720 

GATATTTATC AATATATGGA TGACATTTAT ATAGGATCAA ACTTAAGTAA AAAGGAGCAT 780 
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AAA6AAAAAG TAGAAGAATT AAGAAAATTG TTATTATGGT GGGGATTTGA AACCCCGGAA 840 

GACAAATTAC AAGAAGAGCC CCCATATAAG TGGATGGGCT ATGAATTACA TCCATTAACA 900 

TGGTCAATAC AGCAAAAACA ATTAGAAATT CCAGAAAGAC CCACATTAAA TGAACTGCAG 960 

AAATTAGCAG GTAAGATAM CTGGGCCAGT CAAACTATCC CAGACTTAAG TATAAAAGAA 1020 

CTAACTAACA TGATGAGAGG AGATCAGAAG TTAGACTCAA TAAGAGAATG GACTGTGGAA 1080 

GCCAAGAGAG AAGTACAAAA AGCTAAGGAA GCTATTGAGA TGCAAGCACA GCTAAATTAT 1140 

TATGATCCCC ACCGAGAATT ATATGCAAAA TTAAGTTTAG TGGGACCACA TCAMTATGT 1200 

TATCAAGTGT ATCATAAGAA CCCAGAATGT ATTTTATGGT ATGGTAAGAT GAATAGACAA 1260 

AAGAAAAAGG CAGAAAATAC CTGTGATATA GCTCTAAGGG CATGTTATAA AATAAGAGAA 1320 

GAATCTATTA TAAGMTAGG AAAAGAACCA ATATATGAAA TACCTACTTC TAGAGAAGCC 1380 

TGGGAGTCAA ATTTAATTAA TTCACCATAT CTTAAGGCCC CACCTCCTGA GGTAGAATAT 1440 

ATCCATGCTG CTGTGAATAT AAAAAGAGCA TTAAGTATGA TAAAAGATGT TCCAATACCA 1500 

GAAGCAGAAA CGTGGTATAT AGATGGAGGC AGAAAGCTAG GAAAAGCAGC AAAAGCAGCC 1560 

TATTGGACAG ATACAGGGAA GTGGCAAGTA ATGGAGTTAG AAGGCAGTAA TCAGAAGGCA 1620 

GMGTACAAG CATTATTATT GGCATTAAAA GCAGGATCAG AGGAAATGAA TATTATAACA 1680 

GATTCACAAT ATGTTATAAA TATTATTCTT CAACAACCAG ATATGATGGA GGGMTCTGG 1740 

CAAGAAGTTT TAGAAGAATT GGAGAAAAAA ACAGCAATAT TTATAGATTG GGTCCCAGGA 1800 
CATAAAGGTA TTCCAGGAAA TGAGGAAGTA GATAAGCTTT GTCAAACAAT GATGATAATA 1860 
GAAGGGGATG GGATATTAGA TAAAAGGTCA GAAGATGCGG GATATGATTT ATTGGCTGCA 1920 
AAAGAAATAC ATTTATTGCC AGGAGAGGTA AAAGTAATAC CAACAGGGGT AAAGCTAATG 1980 
CTGCCTAAAG GACATTGGGG ACTAATAATG GGAAGAAGCT CGATAGGGAG TAAAGGATTG 2040 
GATGTATTAG GAGGGGTAAT AGATGAAGGA TATCGAGGTG AAATTGGAGT AATAATGATT 2100 
AATGTATCAA GAAAATCAAT CACCTTAATG GAACAACAAA AGATAGCACA ATTAATAATA 2160 
TTGCCTTGTA AACATGAAGT ATTAGAACAA GGAAAAGTTG TAATGGATTC AGAGAGAGGA 2220 
GACAAAGGTT ATGGGTCAAC AGGAGTATTC TCCTCTTGGG TTGACAGGAT TGAGGAAGCA 2280 
GAAATAAATC ATGAAAAATT TCACTCAGAT CCACAATACT TAAGGACTGA ATTTAATTTA 2340 
CCCAAGATGG TTGCAGAAGA GATAAGACGA AAGTGCCCTG TATGTAGAAT CAGAGGAGAA 2400 
CAAGTGGGAG GACAATTGAA AATAGGGCCT GGAATATGGC AAGTGGATTG CACACACTTT 2460 
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AATAGTAAGA TAATCATTGT AGCAGTACAT GTGGAATCAG GATTTTTATG GGCACAGATA 2520 

ATTCCACAGG AGACTGCAGA TTGTACAGTC AAGGCTCTTC TGCAACTTAT ATGTGCTCAT 2580 

AATGTTACAG AATTACAAAC AGACAATGGA CCAAATTTTA AAAATCAGAA AATGGAAGGT 2640 

TTATTAAATT TTATGGGAAT AAAACATAAA TTAGGGATAC CAGGTAACCC ACAATCACAG 2700 

GCATTAGTGG AAAATGCTAA TAACACATTA AAAGCTTGGA TTCAAAAATT CCTACCAGAG 2760 

ACTACCTCTC TGGATAATGC TCTGGCCCTA GCCCTGTATA GTCTCAACTT TAAACAAAGG 2820 

GGTAGACTAG GAAGGATGGC CCCTTATGM TTATACATAC AACAAGAATC ATTAAGAATA 2880 

CAAGACTATT TTTCGCAGAT TCCACAAAAG TTAATGATGC AGTGGGTGTA TTACAAAGAT 2940 

CAAAAAGACA AAAAATGGAA GGGACCAATG AGAGTGGAAT ATTGGGGACA AGGATCAGTA 3000 

TTATTAAAGG ATGAAGAGAA GGGATATTTT CTTGTACCTA GGAGACACAT AAGAAGAGTC 3060 

CCAGAACCCT GCACTCTTCC TGAAGGGGAT GAGTGACGAA GATTGGCAGG TAAGTAGAAG 3120 

ACTCTTTGCA GTGCTCCAAG GAGGAGTACG TAGTGCTATG CTATACATAT CTAGACTACC 3180 

TCCGGACGAA AGAGAAAGGT ATAAAAAAGA CTTTAAGAAA AGGCTTTTGG AAAAGGAAAC 3240 

AGGATTCATA CAGAGATTAA GAAAAGCGGA AGGAATAAGG TGGAGCTTCC ATACTAGAGA 3300 

TTATTATATA GGATATGTAA GAGAGATGGT GGCCGGATCT AGTCTACCAG ATAGTTTAAG 3360 

ACTGTATATT TATATAAGCA ATCCATTGTG GCACTGGTCA TACCGTCCTG GCCTGACAAA 3420 

TTTTAATACA GAATGGCCTT TTGTGAATAT GTGGATAAAG ACAGGATTCA TGTGGGATGA 3480 

TATTGAAAGC CAGMTATTT GCAAAGGAGG AGAGATTTCA CATGGATGGG GACCTGGAAT 3540 

GGTGGGAATT GTGATAAAAG CTTTTAGTTG TGGAGAAAGA AAGATTGAGG CTACTCCTGT 3600 

AATGATTATA AGAGGAGAAA TAGATCCAAA AAAATGGTGT GGAGATTGTT GGAATTTGAT 3660 

GTGTCTTAGG AACTCACCTC CACAGACTTT ACAAAGACTT GCTATGTTGG CATGTGGCGT 3720 

GCCGGCTAAG GAGTGGCGAG GATGCTGTAA TCAACGCTTT GTTTCTCCTT ACAGAACGCC 3780 

TGCTGATTTG GAGGTCATTC AATCCAAGCC CAGCTGGAGT CTATTATGGT CAGGGAGCCT 3840 
A 



3841 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3093 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS . 

(B) LOCATION: 1..3093 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 

ATG ATT GGA GTA GGA GGA GGA AAG AGA GGA ACA AAT TAT ATC AAT GTG 48 
Met lie Gly Val Gly Gly Gly Lys Arg Gly Thr Asn Tyr He Asn Val 
90 95 iUU 

CAT TTA GAG ATT AGA GAT GAA AAT TAT AAG ACA CAA TGT ATA TTT GGC 96 
ml S Glu lie A?S Asp Glu Asn Tyr Lys Thr Gin Cys He Phe G y 
105 ' HO 115 

T rTr m fiM GAT MC TC A TTA ATA CAA CCA TTA TTA GGG 144 
Z £1 SI Val Si GlS Z Mn Ser Leu He Gin Pro Leu Leu 61, 

^t. rfiT aat ATP ATT AGA TTC AAT ATT AGG TTA GTA ATG GCT CAA ATT 192 
$ Z to! Set lie SS IS "g « Leu Val Met Ala Gin lie 



140 



240 



288 



TTA ACA GAA ATA GTA GAA AGA CTA GAA AGA GM GGG AAA GTA AAA AGA 336 
Leu Thr Glu He Val Glu Arg Leu Glu Arg Gib Gly Lys Val Lys Arg 



TCT GAC AAG An CCA ATA GTA AAA GTA AAA ATG AAG GAT CCA AAT AAA 
ler Lys lie Pro lie Val Lys Val Lys Met Lys Asp Pro Asn Lys 
155 160 15b 

rrA TCT CAA ATA AAA CAA TGG CCA TTA ACA AAT GAA AAA ATT GAA GCT 
Gly" Pro GlS He fys GlS Trp Pro Leu Thr Asn Glu Lys lie Glu Ala 
170 175 180 

GAA ATA GTA GAA AGA CTA GAA AGA 
Glu He Val Glu Arg Leu Glu Arg 
185 '"" 190 195 

GCA GAT CCA AAT AAC CCA TGG AAT ACA CCA GTA TTT GCA ATA AAA AAG 384 
Ala Asp Pro Asn Asn Pro Trp Asn Thr Pro Val Phe Ala l le Lys Lys 
205 21° 

AAA AGT GGA AAA TGG AGA ATG CTC ATA GAT TTT AGA GAA TTG AAC AAA 432 
Lys Ser Gly Lys Trp Arg Met Leu He Asp Phe Arg Glu Leu Asn Lys 
220 225 ^ JU 

22 £ US Si 8 Si Hi SS SS S SJ 2£ c £ 8K & ^ 480 
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235 



240 



245 



GGA TTA AAA ATG AAA AAA CAA GTT ACT GTG CTA GAT ATA GGA GAT GCA 528 
Glv Leu Lys Met Lys Lys Gin Val Thr Val Leu Asp He Gly Asp Ala 
250 255 260 

TAC TTC ACT ATT CCC TTG GAT CCA GAC TAT GCT CCC TAT ACT GCA TTC 576 
Tyr Phe Thr He Pro Leu Asp Pro Asp Tyr Ala Pro Tyr Thr Ala Phe 
265 270 275 280 

ACA TTA CCT AGA AAG AAT AAT GCA GGA CCA GGG AGG AGA TAT GTA TGG 624 
Thr Leu Pro Arg Lys Asn Asn Ala Gly Pro Gly Arg Arg Tyr Val Trp 
285 290 295 

TGC AGT TTA CCA CAG GGG TGG GTT CTA AGC CCA TTG ATA TAT CAA AGT 672 
Cvs Ser Leu Pro Gin Gly Trp Val Leu Ser Pro Leu He Tyr Gin Ser 
y 300 305 310 

ACT TTA GAT AAT ATA ATA CAA CCT TTT ATT AGA CAA AAT CCT GAG TTA 720 
Thr Leu Asp Asn He He Gin Pro Phe He Arg Gin Asn Pro Glu Leu 
315 320 325 

GAT ATT TAT CAA TAT ATG GAT GAC ATT TAT ATA GGA TCA AAC TTA AGT 768 
Asp He Tyr Gin Tyr Met Asp Asp lie Tyr He Gly Ser Asn Leu Ser 
330 335 340 

AAA AAG GAG CAT AAA GAA AAA GTA GAA GAA TTA AGA AAA TTG TTA TTA 816 
Lys Lys Glu His Lys Glu Lys Val Glu Glu Leu Arg Lys Leu Leu Leu 
345 350 355 360 

TGG TGG GGA TTT GAA ACC CCG GAA GAC AAA TTA CAA GAA GAG CCC CCA 864 
TrD TrD Gly Phe Glu Thr Pro Glu Asp Lys Leu Gin Glu Glu Pro Pro 
w 365 370 375 

TAT AAG TGG ATG GGC TAT GAA TTA CAT CCA TTA ACA TGG TCA ATA CAG 912 
Tyr Lys Trp Met Gly Tyr Glu Leu His Pro Leu Thr Trp Ser He Gin 
380 385 390 

CAA AAA CAA TTA GAA ATT CCA GAA AGA CCC ACA TTA AAT GAA CTG CAG 960 
Gin Lys Gin Leu Glu He Pro Glu Arg Pro Thr Leu Asn Glu Leu Gin 
395 400 405 

AAA TTA GCA GGT AAG ATA AAC TGG GCC AGT CAA ACT ATC CCA GAC TTA 1008 
Lys Leu Ala Gly Lys He Asn Trp Ala Ser Gin Thr He Pro Asp Leu 
410 415 420 

AGT ATA AAA GAA CTA ACT AAC ATG ATG AGA GGA GAT CAG AAG TTA GAC 1056 
Ser He Lys Glu Leu Thr Asn Met Met Arg Gly Asp Gin Lys Leu Asp 
425 430 435 440 

TCA ATA AGA GAA TGG ACT GTG GAA GCC AAG AGA GAA GTA CAA AAA GCT 1104 
Ser lie Arq Glu Trp Thr Val Glu Ala Lys Arg Glu Val Gin Lys Ala 
445 450 455 

AAG GAA GCT An GAG ATG CAA GCA CAG CTA AAT TAT TAT GAT CCC CAC 1152 
Lys Glu Ala. He Glu Met Gin Ala Gin Leu Asn Tyr Tyr Asp Pro His 
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460 465 470 

ai!!is^atJJffi£SS?"~S^®K 1200 

^ 475 480 wo 

TAT CM GTG TAT CAT AAG AAC CCA GAA TGT ATT TTA TGG TAT GGT AAG 1248 
}yl G^ Val TyV His Lys Asn Pro Glu Cys He Leu Trp Tyr Gly Lys 
J 490 495 

s ss « as s $ s Aia as ss I si ss I™ si g 1296 

505 510 

« GCA TGT TAT AAA ATA AGA GAA GAA TCT ATT ATA AGA ATA GGA AM 1344 
Arg Ala Cys Tyr Lys He Arg Glu Glu Ser He Ue Arg lie |iy ^ 

S2S 

ru rr , , TA TAT GAA ATA CCT ACT TCT AGA GAA GCC TGG GAG TCA AAT 1392 
Pri He t" Gli He Pro Thr Ser Arg Glu Ala Trp Glu Ser Asn 

540 545 DDV 

555 560 Da;j 

570 575 S0 

sssisssKgsgiiiiss^S""""* 1536 

585 590 

ffi ft $ SS § $ Ala S Tyr § Si SS £ S5 g ^ ^ 

as sf ss is s as ss ss § as « as g iK sa 1632 

620 625 

H5 SS SS SS BS $ Si § 5f IS ^ IS g SI ft «f 

635 640 

gssMasiKSissniiiiSgs^i^^S 

665 670 

Si E Si £ 55 S5 Si 8 - $ §5 - - * - - 1824 



1440 

1488 



1680 
1728 
1776 
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685 



690 



695 



GAA GTA GAT AAG CTT TGT CAA ACA ATG ATG ATA ATA GAA GGG GAT GGG 1872 
Glu Val Asp Lys Leu Cys Gin Thr Met Met He He Glu Gly Asp Gly 
700 705 710 

ATA TTA GAT AAA AGG TCA GAA GAT GCG GGA TAT GAT TTA TTG GCT GCA 1920 
He Leu Asp Lys Arg Ser Glu Asp Ala Gly Tyr Asp Leu Leu Ala Ala 
715 720 725 

AAA GAA ATA CAT TTA TTG CCA GGA GAG GTA AAA GTA ATA CCA ACA GGG 1968 
Lys Glu He His Leu Leu Pro Gly Glu Val Lys Val He Pro Thr Gly 
730 735 740 

GTA AAG CTA ATG CTG CCT AAA GGA CAT TGG GGA CTA ATA ATG GGA AGA 2016 
Val Lys Leu Met Leu Pro Lys Gly His Trp Gly Leu He Met Gly Arg 
745 750 755 760 

AGC TCG ATA GGG AGT AAA GGA TTG GAT GTA TTA GGA GGG GTA ATA GAT 2064 
Ser Ser He Gly Ser Lys Gly Leu Asp Val Leu Gly Gly Val He Asp 
765 770 775 

GAA GGA TAT CGA GGT GAA ATT GGA GTA ATA ATG An AAT GTA TCA AGA 2112 
Glu Gly Tyr Arg Gly Glu He Gly Val He Met He Asn Val Ser Arg 
780 785 790 

AAA TCA ATC ACC TTA ATG GAA CAA CAA AAG ATA GCA CAA TTA ATA ATA 2160 
Lvs Ser He Thr Leu Met Glu Gin Gin Lys He Ala Gin Leu lie He 
y 795 800 805 

TTG CCT TGT AAA CAT GAA GTA TTA GAA CAA GGA AAA GTT GTA ATG GAT 2208 
Leu Pro Cys Lys His Glu Val Leu Glu Gin Gly Lys Val Val Met Asp 
810 815 820 

TCA GAG AGA GGA GAC AAA GGT TAT GGG TCA ACA GGA GTA TTC TCC TCT 
Ser Glu Arg Gly Asp Lys Gly Tyr Gly Ser Thr Gly Val Phe Ser Ser 
825 830 835 840 

TGG GTT GAC AGG ATT GAG GAA GCA GAA ATA AAT CAT GAA AAA TTT CAC 
Tpn Val Asd Arg He Glu Glu Ala Glu He Asn His Glu Lys Phe His 
y 845 850 855 

TCA GAT CCA CAA TAC TTA AGG ACT GAA TTT AAf TTA CCC AAG ATG GTT 
Ser Asp Pro Gin Tyr Leu Arg Thr Glu Phe Ash Leu Pro Lys Met Val 
860 865 870 

GCA GAA GAG ATA AGA CGA AAG TGC CCT GTA TGT AGA ATC AGA GGA GAA 
Ala Glu Glu lie Arg Arg Lys Cys Pro Val Cys Arg He Arg Gly Glu 
875 880 885 

CAA GTG GGA GGA CAA TTG AAA ATA GGG CCT GGA ATA TGG CAA GTG GAT 
Gin Val Gly Gly Gin Leu Lys lie Gly Pro Gly He Trp Gin Val Asp 
890 895 900 

TGC ACA CAC TTT AAT AGT AAG ATA ATC ATT GTA GCA GTA CAT GTG GAA 
Cys Thr His Phe Asn Ser Lys He He He Val Ala Val His Val Glu 
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915 920 



905 910 

TGG GCA 
Trp Ala 

925 930 

AfA GTC AAG GCT CTT CTG CAA CTT ATA TGT GCT CAT AAT GTT ACA GAA 
Th? Val Lys Ala Leu Leu Gin Leu lie Cys Ala His Asn Val Thr Glu 



TTA GGA TTT TTA TGG GCA CAG ATA ATT CCA CAG GAG ACT GCA GAT TGT 2544 
Se? Gly SI Trp Ala Gin He lie Pro Gin Glu Thr Ala Asp Cys 



1020 



ACT CTT CCT GAA GGG GAT GAG 
Thr Leu Pro Glu Gly Asp Glu 
1115 



2592 



940 945 950 

tta TAA ACA GAC AAT GGA CCA AAT TTT AAA AAT CAG AAA ATG GAA GGT 2640 
™ GlS Thr /Sp Z Gl? Pro Asn Phe Lys Asn Gin Lys Met Glu Gly 
955 960 

TTA TTA AAT TTT ATG GGA ATA AAA CAT AAA nA GGG ATA CCA GGT AAC 2688 
™ US ten Phe Me? Gly He Lys His Lys Leu Gly He Pro Gly Asn 
970 975 y° u 

fTA CAA TCA CAG GCA TTA GTG GAA AAT GCT AAT AAC ACA TTA AAA GCT 2736 
Pro Gin ler Gin Ala Leu Val Glu Asn Ala Asn Asn Thr Leu Lys A a 
985 990 995 ^uu 

TPS ATT CAA AAA TTC CTA CCA GAG ACT ACC TCT CTG GAT AAT GCT CTG 2784 
Trp III Gl£ fyt Phe Leu Pro Glu Thr Thr Ser Leu Asp Asn Aia^Leu 
1005 

GCC CTA GCC CTG TAT AGT CTC AAC TTT AAA CAA AGG GGT AGA CTA GGA 2832 
Ala 25 Ala Leu Tyr Ser Leu Asn Phe Lys Gin Arg Gly Arg Leu Gly 



2880 



AfT ATG GCC CCT TAT GAA TTA TAC ATA CAA CAA GAA TCA TTA AGA ATA 
A G rg Met Ala Pro T^r Glu Leu Tyr He Gin Gin Glu er Leu Arg He 
1035 1040 1U4b 

CAA GAC TAT TTT TCG CAG ATT CCA CAA AAG nA ATG ATG CAG TGG GTG 
GlS Asp Tyr Phe Ser Gin He Pro Gin Lys Leu Met Met Gin Trp Val 
1050 1055 iUDU 

TAT TAC AAA GAT CAA AAA GAC AAA AAA TGG AAG GGA CCA ATG AGA GTG 
Ty'r f£ 1% Gin Lys Asp Lys Lys Trp Lys Gly Pro Met Arg Val Q 
1065 1070 lu 

GAA TAT TGG GGA CAA GGA TCA GTA TTA TTA AAG GAT GAA GAG AAG GGA 
Glu Tyr Trp Gly Gin Gly Ser Val Leu LeuLys Asp Glu Glu Lys^Gly 
1 OSS 

TAT TTT CTT GTA CCT AGG AGA CAC ATA AGA AGA GTC CCA GAA CCC TGC 3072 
t" 52 lS ™ Pro Arg Arg His lie Arg Arg Val Pro Glu Pro Cys 
J noo 1105 



2928 



2976 



3024 



3093 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1031 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met He Gly Val Gly Gly Gly Lys Arg Gly Thr Asn Tyr He Asn Val 
1 5 10 15 

His Leu Glu He Arg Asp Glu Asn Tyr Lys Thr Gin Cys He Phe Gly 
20 25 30 

Asn Val Cys Val Leu Glu Asp Asn Ser Leu He Gin Pro Leu Leu Gly 
35 40 45 

Arq Asd Asn Met He Arg Phe Asn He Arg Leu Val Met Ala Gin He 
50 55 60 

Ser Asp Lys He Pro He Val Lys Val Lys Met Lys Asp Pro Asn Lys 
65 70 75 80 

Glv Pro Gin He Lys Gin Trp Pro Leu Thr Asn Glu Lys He Glu Ala 
85 90 95 

Leu Thr Glu He Val Glu Arg Leu Glu Arg Glu Gly Lys Val Lys Arg 
100 105 no 

Ala Asp Pro Asn Asn Pro Trp Asn Thr Pro Val Phe Ala lie Lys Lys 
115 120 125 

Lys Ser Gly Lys Trp Arg Met Leu He Asp Phe Arg Glu Leu Asn Lys 
130 135 140 

Leu Thr Glu Lys Gly Ala Glu Val Gin Leu Gly Leu Pro His Pro Ala 
145 150 155 160 

Glv Leu Lys Met Lys Lys Gin Val Thr Val Leu Asp He Gly Asp Ala 
165 170 * 175 

Tyr Phe Thr He Pro Leu Asp Pro Asp Tyr Ala Pro Tyr Thr Ala Phe 
180 185 190 

Thr Leu Pro Arg Lys Asn Asn Ala Gly Pro Gly Arg Arg Tyr Val Trp 
195 200 205 

Cys Ser Leu Pro Gin Gly Trp Val Leu Ser Pro Leu He Tyr Gin Ser 
210 215 220 

Thr Leu Asp Asn He He Gin Pro Phe He Arg Gin Asn Pro Glu Leu 
225 230 235 240 
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Asp He Tyr Gin Tyr Met Asp Asp He Tyr He Gly Ser Asn Leu Ser 
r 245 "0 " 3 

Lys Lys Glu His Lys Glu Lys Val Glu Glu Leu Arg Lys Leu Leu Leu 
260 265 2/u 

Trn TrD Gly Phe Glu Thr Pro Glu Asp Lys Leu Gin Glu Glu Pro Pro 
1 p v 275 280 285 

Tvr Lys Trp Met Gly Tyr Glu Leu His Pro Leu Thr Trp Ser He Gin 
J 290 295 300 

Gin Lys Gin Leu Glu He Pro Glu Arg Pro Thr Leu Asn Glu Leu Gin 
305 310 315 

Lys Leu Ala Gly Lys He Asn Trp Ala Ser Gin Thr He Pro Asp Leu 
J 325 330 

Ser He Lys Glu Leu Thr Asn Met Met Arg Gly Asp Gin Lys Leu Asp 
340 345 JbU 

Ser He Arg Glu Trp Thr Val Glu Ala Lys Arg Glu Val Gin Lys Ala 
355 360 365 

Lys Glu Ala He Glu Met Gin Ala Gin Leu Asn Tyr Tyr Asp Pro His 

370 375 380 

Arg Glu Leu Tyr Ala Lys Leu Ser Leu Val Gly Pro His Gin He Cys 
385 390 395 

Tyr Gin Val Tyr His Lys Asn Pro Glu Cys lie Leu Trp Tyr Gly Lys 
J 405 410 4lt> 

Met Asn Arg Gin Lys Lys Lys Ala Glu Asn Thr Cys Asp lie Ala Leu 

420 425 

Arg Ala Cys Tyr Lys He Arg Glu Glu Ser He He Arg He Gly Lys 
435 440 44b 

Glu Pro He Tyr Glu He Pro Thr Ser Arg Glu Ala Trp Glu Ser Asn 
450 455 460 

Leu lie Asn Ser Pro Tyr Leu Lys Ala Pro Pro Pro Glu Val Glu Tyr 
465 470 4 ?5 4HU 

He His Ala Ala Val Asn lie Lys Arg Ala Leu Ser Met lie Lys Asp 
485 490 

Val Pro He Pro Glu Ala Glu Thr Trp Tyr He Asp Gly Gly Arg Lys 
500 505 3iu 

Leu Gly Lys Ala Ala Lys Ala Ala Tyr Trp Thr Asp Thr Gly Lys Trp 
515 520 

Gin Val Met Glu Leu Glu Gly Ser Asn Gin Lys Ala Glu Val Gin Ala 
530 535 ^ 4U 
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Leu Leu Leu Ala Leu Lys Ala Gly Ser Glu Glu Met Asn He He Thr 
545 550 555 560 

Asp Ser Gin Tyr Val He Asn He He Leu Gin Gin Pro Asp Met Met 
565 570 575 

Glu Gly He Trp Gin Glu Val Leu Glu Glu Leu Glu Lys Lys Thr Ala 
580 . 585 590 

lie Phe He Asp Trp Val Pro Gly His Lys Gly He Pro Gly Asn Glu 
595 600 605 

Glu Val Asp Lys Leu Cys Gin Thr Met Met He lie Glu Gly Asp Gly 
610 615 620 

He Leu Asp Lys Arg Ser Glu Asp Ala Gly Tyr Asp Leu Leu Ala Ala 
625 630 635 640 

Lys Glu lie His Leu Leu Pro Gly Glu Val Lys Val He Pro Thr Gly 
645 650 655 

Val Lys Leu Met Leu Pro Lys Gly His Trp Gly Leu He Met Gly Arg 
660 665 670 

Ser Ser lie Gly Ser Lys Gly Leu Asp Val Leu Gly Gly Val lie Asp 
675 680 685 

Glu Gly Tyr Arg Gly Glu He Gly Val He Met He Asn Val Ser Arg 
690 695 700 

Lys Ser He Thr Leu Met Glu Gin Gin Lys He Ala Gin Leu He lie 
705 710 715 720 

Leu Pro Cys Lys His Glu Val Leu Glu Gin Gly Lys Val Val Met Asp 
725 730 735 

Ser Glu Arg Gly Asp Lys Gly Tyr Gly Ser Thr Gly Val Phe Ser Ser 
740 745 750 

TrD Val Asp Arg He Glu Glu Ala Glu He Asn His Glu Lys Phe His 
y 755 760 765 

Ser Asp Pro Gin Tyr Leu Arg Thr Glu Phe AsYi Leu Pro Lys Met Val 
770 775 ' 780 

Ala Glu Glu He Arg Arg Lys Cys Pro Val Cys Arg He Arg Gly Glu 
785 790 795 800 

Gin Val Gly Gly Gin Leu Lys He Gly Pro Gly He Trp Gin Val Asp 
805 810 815 

Cys Thr His Phe Asn Ser Lys He He He Val Ala Val His Val Glu 
820 825 830 

Ser Gly Phe Leu Trp Ala Gin He lie Pro Gin Glu Thr Ala Asp Cys 
835 840 845 
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Thr Val Lys Ala Leu Leu Gin Leu He Cys Ala His Asn Val Thr Glu 
850 855 860 

Leu Gin Thr Asp Asn Gly Pro Asn Phe Lys Asn Gin Lys Met Glu Gly 
865 870 875 880 

Leu Leu Asn Phe Met Gly He Lys His Lys Leu Gly He Pro Gly Asn 
885 890 89b 

Pro Gin Ser Gin Ala Leu Val Glu Asn Ala Asn Asn Thr Leu Lys Ala 
900 905" yi(J 

TrD He Gin Lys Phe Leu Pro Glu Thr Thr Ser Leu Asp Asn Ala Leu 
915 920 925 

Ala Leu Ala Leu Tyr Ser Leu Asn Phe Lys Gin Arg Gly Arg Leu Gly 
930 935 940 

Arg Met Ala Pro Tyr Glu Leu Tyr He Gin Gin Glu Ser Leu Arg He 
945 950 955 960 

Gin Asp Tyr Phe Ser Gin He Pro Gin Lys Leu Met Met Gin Trp Val 
Tyr Tyr Lys Asp Gin Lys Asp Lys Lys Trp Lys Gly Pro Met Arg Val 

980 985 

Glu Tyr Trp Gly Gin Gly Ser Val Leu Leu Lys Asp Glu Glu Lys Gly 
995 1000 luUb 

Tyr Phe Leu Val Pro Arg Arg His He Arg Arg Val Pro Glu Pro Cys 
3 1010 1015 1020 

Thr Leu Pro Glu Gly Asp Glu 
1025 1030 

(2) INFORMATION FOR SEQ ID N0:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .753 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

ATG ATT GAC GAA GAT TGG CAG GTA AGT AGA AGA CTC TTT GCA GTG CTC 
Met lie Asp Glu Asp Trp Gin Val Ser Arg Arg Leu Phe Ala Val Leu 
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1035 



1040 



1045 



CAA GGA GGA GTA CGT AGT GCT ATG CTA TAC ATA TCT AGA CTA CCT CCG 96 
Gin Gly Gly Val Arg Ser Ala Met Leu Tyr He Ser Arg Leu Pro Pro 
1050 1055 1060 

GAC GAA AGA GAA AGG TAT AAA AAA GAC TTT AAG AAA AGG CTT- TTG GAA 144 
Asp Glu Arg Glu Arg Tyr Lys Lys Asp Phe Lys Lys Arg Leu Leu Glu 
1065 1070 1075 

AAG GAA ACA GGA TTC ATA CAG AGA TTA AGA AAA GCG GAA GGA ATA AGG 192 
Lys Glu Thr Gly Phe He Gin Arg Leu Arg Lys Ala Glu Gly He Arg 
1080 1085 1090 1095 

TGG AGC TTC CAT ACT AGA GAT TAT TAT ATA GGA TAT GTA AGA GAG ATG 240 
Trp Ser Phe His Thr Arg Asp Tyr Tyr lie Gly Tyr Val Arg Glu Met 
1100 1105 1110 

GTG GCC GGA TCT AGT CTA CCA GAT AGT TTA AGA CTG TAT ATT TAT ATA 288 
Val Ala Gly Ser Ser Leu Pro Asp Ser Leu Arg Leu Tyr He Tyr He 
1115 1120 1125 

AGC AAT CCA TTG TGG CAC TGG TCA TAC CGT CCT GGC CTG ACA AAT TTT 336 
Ser Asn Pro Leu Trp His Trp Ser Tyr Arg Pro Gly Leu Thr Asn Phe 
1130 1135 1140 

AAT ACA GAA TGG CCT TTT GTG AAT ATG TGG ATA AAG ACA GGA TTC ATG 384 
Asn Thr Glu Trp Pro Phe Val Asn Met Trp He Lys Thr Gly Phe Met 
1145 1150 1155 

TGG GAT GAT ATT GAA AGC CAG AAT An TGC AAA GGA GGA GAG ATT TCA 432 
Trp Asp Asp He Glu Ser Gin Asn He Cys Lys Gly Gly Glu He Ser 
1160 1165 1170 1175 

CAT GGA TGG GGA CCT GGA ATG GTG GGA ATT GTG ATA AAA GCT TTT AGT 480 
His Gly Trp Gly Pro Gly Met Val Gly He Val He Lys Ala Phe Ser 
1180 1185 1190 

TGT GGA GAA AGA AAG ATT GAG GCT ACT CCT GTA ATG ATT ATA AGA GGA 528 
Cys Gly Glu Arg Lys He Glu Ala Thr Pro Val Met lie He Arg Gly 
1195 1200 1205 

GAA ATA GAT CCA AAA AAA TGG TGT GGA GAT TGT TGG AAT TTG ATG TGT 576 
Glu He Asp Pro Lys Lys Trp Cys Gly Asp Cys Trp Asn Leu Met Cys 
1210 1215 1220 

CTT AGG AAC TCA CCT CCA CAG ACT TTA CAA AGA CTT GCT ATG TTG GCA 624 
Leu Arg Asn Ser Pro Pro Gin Thr Leu Gin Arg Leu Ala Met Leu Ala 
1225 1230 1235 

TGT GGC GTG CCG GCT AAG GAG TGG CGA GGA TGC TGT AAT CAA CGC TTT 672 
Cys Gly Val Pro Ala Lys Glu Trp Arg Gly Cys Cys Asn Gin Arg Phe 
1240 1245 1250 1255 

GTT TCT CCT TAC AGA ACG CCT GCT GAT TTG GAG GTC ATT CAA TCC AAG 720 
Val Ser Pro Tyr Arg Thr Pro Ala Asp Leu Glu Val He Gin Ser Lys 
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1260 1265 1270 

CCC AGC TGG AGT CTA TTA TGG TCA GGG AGC CTA 753 
Pro Ser Trp Ser Leu Leu Trp Ser Gly Ser Leu 
1275 1280 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met He Asp Glu Asp Trp Gin Val Ser Arg Arg Leu Phe Ala Val Leu 
1 5 10 15 

Gin Gly Gly Val Arg Ser Ala Met Leu Tyr He Ser Arg Leu Pro Pro 
20 25 30 

Asp Glu Arg Glu Arg Tyr Lys Lys Asp Phe Lys Lys Arg Leu Leu Glu 
.35 40 45 

Lys Glu Thr Gly Phe He Gin Arg Leu Arg Lys Ala Glu Gly He Arg 
50 55 60 

Trp Ser Phe His Thr Arg Asp Tyr Tyr He Gly Tyr Val Arg Glu Met 
65 70 75 80 

Val Ala Gly Ser Ser Leu Pro Asp Ser Leu Arg Leu Tyr He Tyr lie 
85 90 95 

Ser Asn Pro Leu Trp His Trp Ser Tyr Arg Pro Gly Leu Thr Asn Phe 
100 1° 5 11U 

Asn Thr Glu Trp Pro Phe Val Asn Met Trp He Lys Thr Gly Phe Met 
XI 5 120 

Trp Asp Asp He Glu Ser Gin Asn He Cys ly} Gly Gly Glu He Ser 
130 135 1^0 

His Gly Trp Gly Pro Gly Met Val Gly He Val He Lys Ala Phe Ser 
145 150 155 160 

Cys Gly Glu Arg Lys He Glu Ala Thr Pro Val Met He lie Arg Gly 
165 1/3 

Glu He Asp Pro Lys Lys Trp Cys Gly Asp Cys Trp Asn Leu Met Cys 
180 185 J-W 

Leu Arg Asn Ser Pro Pro Gin Thr Leu Gin Arg Leu Ala Met Leu Ala - 
195 200 ^05 
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Cys Gly Val Pro Ala Lys 61 u Trp Arg Gly Cys Cys Asn Gin Arg Phe 
210 215 220 

Val Ser Pro Tyr Arg Thr Pro Ala Asp Leu Glu Val He Gin Ser Lys 
225 230 235 240 

Pro Ser Trp Ser Leu Leu Trp Ser Gly Ser Leu 
245 250 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2556 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ATGGCAGAAG GATTTGCAGC CAATAGACAA TGGATAGGAC CAGAAGAAGC TGAAGAGTTA 60 

TTAGATTTTG ATATAGCAAC ACAAATGAAT GAAGAAGGGC CACTAAATCC AGGGATGAAC 120 

CCATTTAGGG TACCTGGAAT AACAGATAAA GAAAAGCAAG ACTATTGTAA CATATTACAA 180 

CCTAAGTTAC AAGATTTACG GAATGAACTT CAAGAGGTAA AACTAGAAGA AGGAAATGCA 240 

GGTAAGTTTA GAAGGGCAAG ATATTTAAGA TATTCTGATG AAAATGTGCT ATCTATAGTC 300 

TATTTGCTAA TAGGATATCT AAGATATTTA ATAAATCGTA GGAGTTTAGG ATCTTTAAGA 360 

CATGATATAG ACATAGAAAC ACCTCAAGAG GAATATTATA GTAATAGTGA AAGGGGTACC 420 

ACATTAAATC AAAAATATGC GAGAAGATGT TGTGTTAGCA CACTTATTAT GTATTTAATT 480 

CTTTTTGCAG TAGGCATCTG GTGGGGAGCT AGAGCACAAG TAGTGTGGAG ACTTCCCCCT 540 

TTAGTAGTTC CAGTAGAAGA ATCAGAAATA ATTTTTTGGG ATTGTTGGGC ACCAGAAGAA 600 

CCCGCCTGTC AAGACTTTCT TGGGGCAATG ATACATCTAA AAGCTAGTAC GAATATAAGT 660 

ATACAAGAGG GACCTACCTT GGGGAATTGG GCTAGAGAAA TATGGGGAAC ATTATTCAAA 720 

AAGGCTACCA GACAATGTAG AAGAGGTAGA ATATGGAAAA GATGGAATGA AACTATAACA 780 
GGACCATTAG GATGTGCTAA TAACACATGT TATAATATTT CAGTAATAGT ACCTGATTAT 840 
CAATGTTATC TAGACCGAGT AGATACTTGG TTACAAGGGA AAGTAAATAT ATCATTATGT 900 
CTAACAGGAG GAAAAATGTT GTACAATAAA TATACAAAAC AATTAAGCTA TTGTACAGAC 960 
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CCATTACAAA TCCCACTGAT CAATTATACA TTTGGACCTA ATCAAACATG TATGTGGAAC 1020 

ACTTCACAAA TTCAGGACCC TGAGATACCA AAATGTGGAT GGTGGAATCA AAGAGCCTAT 1080 

TATAAAAATT GTAAATGGGA AAAAACAGAT GTAAAGTTTC ATTGTCAAAG AACACAGAGT 1140 

CAGCCTGGAA CATGGCTTAG AGCAATCTCG TCATGGAGAC AAAGGAATAG ATGGGAATGG 1200 

AGACCAGATT TTGAAAGTGA AAAGGTGAAA ATATCTCTAA AGTGTAATAG CACAAAAAAC 1260 

CTAACCTTTG CAATGAGAAG TTCAGGAGAT TATGGAGAAG TAACGGGAGC TTGGATAGAG 1320 

TTTGGATGTC ATAGAAATAA ATCAAAACTT CATGATGAAG CAAGGTTTAG AATTAGATGT 1380 

AGATGGAATA TAGGGGAGAA TACCTCACTC ATTGATACAT GTGGAAACAC TCAAAATGTT 1440 

TCAGGGGCAA ATCCTGTAGA TTGTACCATG TATGCAAATA AAATGTACAA TTGTTCTTTA 1500 

CAAAACGGGT TTACTATGAA GGTAGATGAC CTTATTATGC ATTTCAATAT GACAAAAGCT 1560 

GTAGAAATGT ATAATATTGC TGGAAATTGG TCTTGTACAT CTGACTTGCC ACCAACATGG 1620 

GGGTATATGA ATTGTAACTG TACAAATAAT AGTAATGATA ATACTAGAAT GGCATGTCCT 1680 

AACAATCAAG GCATCTTAAG GAATTGGTAT AACCCAGTAG CAGGATTACG ACAATCCTTG 1740 

GAAAAGTATC AAGTTGTAAA ACAACCAGAT TACTTAGTGG TCCCAGGGGA AGTCATGGAA 1800 

TATAAAACTA GAAGGAAAAG GGCAGCTATT CATGTTATGT TAGCTCTTGC AACAGTATTA 1860 

TCTATGGCCG GAGCAGGGAC GGGGGCTACT GCTATAGGGA TGGTAACACA ATATCACCAA 1920 

GTTCTAGCAA CCCATCAAGA AGCTATTGAA AAGGTGACTG AAGCCTTAAA GATAAACAAC 1980 

TTGAGATTAG TTACATTAGA GCATCAAGTA CTAGTAATAG GATTAAAAGT AGAAGCTATG 2040 

GAAAAATTTT TATATACAGC TTTCGCTATG CAAGAATTAG GATGTAATCA AAATCAATTC 2100 

TTCTGCAAAG TCCCTCCTGA ATTGTGGATG AGGTATAATA TGTCTATAAA TCAAACAATA 2160 

TGGAATCATG GAAATATAAC TTTGGGGGAA TGGTATAACC AAACAAAAGA TTTACAACAA 2220 

AAGTTTTATG AAATAATAAT GGACATAGAA CAAAATAATG TACAAGGGAA AAAAGGGATA 2280 

CAACAAnAC AAAAGTGGGA AGATTGGGTA GGATGGATAG GAAATATTCC ACAATACTTA 2340 

AAGGGACTAT TGGGAGGTAT CTTGGGAATA GGATTAGGAG TGTTATTATT AATTTTATGT 2400 

TTACCCACAT TGGTTGATTG TATAAGAAAT TGTATCCACA AGATACTAGG ATACACAGTA 2460 

ATTGCAATGC CTGAAGTAGA AGGAGAAGAA ATACAACCAC AAATGGAATT GAGGAGAAAT 2520 

GGTAGGCAAT GTGGCATATC TGAAAMGAG GAGGAA 2556 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..36 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CAA GAA TTA GGA TGT AAT CAA AAT CAA TTC TTC TGC 36 
Gin Glu Leu Gly Cys Asn Gin Asn Gin Phe Phe Cys 
255 260 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Gin Glu Leu Gly Cys Asn Gin Asn Gin Phe Phe Cys 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GGATGAGTAT TGGAACCCTG AA 
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(2) INFORMATION FOR SEQ ID N0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GATTCCGAGA CCTCACAGGT AA 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AATAGGGAAG CAGTAGCAGA C 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 
GTAAATCGCA AATAACCAAC C 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 
TGACGGTGTC TACTGCTGCT 20 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: . SEQ ID NO: 26: 
CACACTGGTC CTGATCCTTT T 21 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CCACAATATG TAGCACTTGA CC 22 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28 
GGGTACTTTC TGGCTTAAGG TG 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 
GGGGGACCTA CCTTGGGGAA TTGGGCT 



(2) INFORMATION FOR SEQ ID NO:30: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 
GGTGATCATG ATCAGTGGGA TTTGTAATGG GTCTG 



(2) INFORMATION FOR SEQ ID N0:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 
GGTGATCATG ATCAGTGGGA TTTGTAATGG GTCTG 
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(2) INFORMATION FOR SEQ ID N0:32: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:32: 
ATAAGGGAGA TACTGTGCTG A 



(2) INFORMATION FOR SEQ ID NO: 33: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 
GCGATCTTCT AACTCTGTCA T 
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THAT WHICH IS CLAIMED IS: 

1. An isolated feline iimiunodeficiency virus (FIV) having all of 
the identifying characteristics of FIV clone JSY3. 

2. An isolated feline immunodeficiency virus (FIV) whose 
proviral DNA comprises a DNA sequence selected from the group 
consisting of SEQ ID N0:1 and sequences which vary from SEQ ID N0:1 
due to the degeneracy of the genetic code. 

3. A biologically pure culture of host cells containing the 
feline immunodeficiency virus of claim 1. 

4. A biologically pure culture of host cells containing the 
feline immunodeficiency virus of claim 2. 

'5. Isolated DNA comprising a DNA sequence selected from the 
group consisting of SEQ ID N0:1 and sequences which vary from SEQ ID 
N0:1 due to the degeneracy of the genetic code. 

6. A vector comprising DNA of claim 5. 

7. A vector according to claim 6. wherein said vector comprises 
bacteriophage lambda. 

8. A host cell containing and capable of expressing a vector 
according to claim 6. 

9. A host cell according to * claim 8, wherein said host cell 
comprises Escherichia coli. 

10. A host cell according to claim 8. wherein said host cell 
comprises a yeast cell . 



11. A host cell according to claim 8. wherein said host cell 
comprises a mammalian host cell. 
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12. Isolated DNA comprising a DNA sequence selected from the 
group consisting of: 

(a) SEQ ID NO:4. SEQ ID N0:5. SEQ ID N0:7. SEQ ID N0:9, 
SEQ ID N0:11. SEQ ID NO: 13. SEQ ID N0:14. SEQ ID NO:16. SEQ ID 
NO: 18. SEQ ID NO: 19; and 

(b) sequences which vary from those of (a) above due to the 
degeneracy of the genetic code. 

13. A vector comprising DNA of claim 12. 

14. A vector according to claim 13. wherein said vector 
comprises bacteriophage lambda. 

15. A host cell containing and capable of expressing a vector 
according to claim 13. 

16. A host cell according to claim 15. wherein said host cell 
comprises Escherichia coli. 

17. A host cell according to claim 15. wherein said host cell 
comprises a yeast cell . 

18. A host cell according to claim 15. wherein said host cell 
comprises a mammalian host cell. 

19. A polypeptide having a sequence selected from the group 
consisting of SEQ ID N0:2. SEQ ID N0:3. SEQ ID N0:6. SEQ ID N0:8. SEQ 
ID NO:10. SEQ ID N0:12. SEQ ID N0:15. SEO ID N0:17. and SEQ ID N0:20. 

20. A specific pathogen free (SPF) cat infected with feline 
immunodeficiency virus clone JSY3. 

21. A colony of SPF cats according to claim 20. 
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FIGURE 1 

460 470 480 



430 440 450 



-> JSY3 

490 500 , 5X0 , 520 ,530 ,540 

IK «S W SSS = 

550 560 570 ^ 530 , 590 , 600 

610 620 , 630 , 640 , 650 , 660 

= = s s= sra 

670 , 680 , 690 , 700 , 7!0 ,720 

730 , 740 , 750 , 760 , 770 , 730 
790 800 , 8!0 , 820 , 830 , 840 

sffi ssss " " — 

850 860 , 870 , 880 , 890 , 900 

™ ssss see = = " 



SUBSTITUTE SHEET (RULE 26) 



WO 98/39451 



PCT/US98/04147 



2/44 

Figure 1, continued 

910 920 930 940 950 960 

** * * * * * * ** * * 

ACAGTGAGTA TCTCTA6TGA AGCAGACTCG AGCTCATAAT CAAGTCACTG TTTAAAGGCC 
TGTCACTCAT AGAGATCACT TCGTCTGAGC TCGAGTATTA GTTCAGTGAC AAATTTCCGG 

970 980 990 1000 1010 1020 

** ** * * ** * * ** 

CAGATAAATT ACATCTGGTG ACTCTTCGCG GACCTTCAAG CCAGGAGATT CGCCGAGGGA 
GTCTATTTAA TGTAGACCAC TGAGAAGCGC CTGGAAGTTC GGTCCTCTAA GCGGCTCCCT 

1030 1040 1050 1060 1070 1080 

** * * ** * * * * * * 

CAGTCAACAA GGTAGGAGAG ATTCTGCAGC AACATGGGGA ACGGACAGGG GCGAGATTGG 
GTCAGTTGTT CCATCCTCTC TAAGACGTCG TTGTACCCCT TGCCTGTCCC CGCTCTAACC 

MG NGQG RDW> 
GAG -> 

1090 1100 1110 1120 1130 1140 

* * * * ★* * * ★* ** 

AAAATGGCCA TTAAGAGATG TAGTAATGTT GCTGTAGGAG TAGGGGGGAA GAGT AAAAAA 
TTTTACCGGT AATTCTCTAC ATCATTACAA CGACATCCTC ATCCCCCCTT CTCATTTTTT 

K MA IKRC SNV A V G VGGK SKK> 

1150 1160 1170 1180 1190 1200 

* * * * ** ** * * * * 

TTTGGAGAAG GGAATTTCAG ATGGGCCATT AGMTGGCTA ATGTATCTAC AGGACGAGAA 
AAACCTCTTC CCTTAAAGTC TACCCGGTAA TCTTACCGAT TACATAGATG TCCTGCTCTT 
FGE GNFR W A I RMA NVST GRE> 

1210 1220 1230 1240 1250 1260 

* * * * ** ** ** ** 

CCTGGTGATA TACCAGAGAC TTTAGATCAA CTAAGGTTGG TTATTTGCGA TTTACAAGAA 
GGACCACTAT ATGGTCTCTG AAATCTAGTT GATTCCAACC AATAAACGCT AAATGTTCTT 
PGD IPET LDQ LRL VICD LQE> 

1270 1280 1290 1300 1310 1320 

* * ** ** ** ** * * 

AGAAGAGAAA AATTTGGGTC GAGCAMGAA ATTGACATGG CAATTGTTAC ATTAAAAGTC 
TCTTCTCTTT TTAAACCCAG CTCGTTTCTT TAACTGTACC GTTAACAATG TAATTTTCAG 
RRE KFGS SKE IDM AIVT LKV> 

1330 1340 1350 1360 1370 1380 

* * ** * * ** ** ** 

TTTGCGGTAG TAGGACTTTT AAATATGACA GTGTCTACTG CTGCTGCAGC TGAAAATATG 
AAACGCCATC ATCCTGAAAA TTTATACTGT CACAGATGAC GACGACGTCG ACTTTTATAC 
FAV VGLL NMT VST A A A A ENM> 
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Figure 1, continued 

1390 1400 1410 1420 1430 1440 

* * ★ * * * ** ** ** 

TACACTCAGA TGGGATTAGA CACTAGACCA TCTATGAGAG AAGCAGGAGG AAAAGAGGAA 
ATGTGAGTCT ACCCTAATCT GTGATCTGGT AGATACTCTC TTCGTCCTCC TTTTCTCCTT 
YTQ MGLD TRP SMR EAGG KEE> 

1450 1460 1470 1480 1490 1500 

^ ^ * * ** ** * * ** 

AGCCCTCCAC AGGCATCTCC TATTCAAACA GCAAATGGAG CACCACAATA TGTAGCACTT 
TCGGGAGGTG TCCGTAGAGG ATAAGTTTGT CGTTTACCTC GTGGTGTTAT ACATCGTGAA 
SPP QASP IQT ANG APQY VAL> 
pl5 <- -> p25 

1510 1520 1530 1540 1550 1560 

it it * * * * * * * * * * 

GACCCAAAAA TGGTGTCCAT TTTTATGGAA AAGGCAAGAG AAGGATTAGG AGGTGAGGAA 
CTGGGTTTTT ACCACAGGTA AAAATACCTT TTCCGTTCTC TTCCTAATCC TCCACTCCTT 
DPK MVSI FME KAR EGLG GEE> 

1570 1580 1590 1600 1610 1620 

^ ^ * * * * ** * * ** 

GTTCAGCTAT GGTTTACTGC CTTCTCTGCA AATTTAACAC CTACTGACAT GGCCACATTA 
CAAGTCGATA CCAAATGACG GAAGAGACGT TTAAATTGTG GATGACTGTA CCGGTGTAAT 
VQL WFTA FSA NLT PTDM ATL> 

1630 1640 1650 1660 1670 1680 

* * ** * ★ ** ** ** 
ATAATGGCCG CACCAGGGTG CGCTGCAGAT AAAGAAATAT TGGATGAAAG CTTAAAGCAA 
TATTACCGGC GTGGTCCCAC GCGACGTCTA TTTCTTTATA ACCTACTTTC GAATTTCGTT 
IMA APGC A A D K E I LDES LKQ> 

1690 1700 1710 1720 1730 1740 

it * * * ** ** ** ** 

TTGACGGCAG AGTATGATCG TACCCATCCT CCTGATGGAC CTAGACCATT ACCCTATTTT 
AACTGCCGTC TCATACTAGC ATGGGTAGGA GGACTACCTG GATCTGGTAA TGGGATAAAA 
L T A F Y D R T H P P n R P R P L P Y F> 

p24A 

1750 1760 1770 1780 1790 1800 

it * * * ** ** ** ** 

ACTGCAGCAG AAATTATGGG TATAGGATTA ACTCAAGAAC AACAAGCAGA AGCAAGATTT 
TGACGTCGTC TTTAATACCC ATATCCTAAT TGAGTTCTTG TTGTTCGTCT TCGTTCTAAA 
TAA EIMG IGL TQE QQAE ARF> 
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Figure 1 , continued 

1810 1820 1830 1840 1850 1860 

* * * * ** ** * * ** 

6CACCAGCTA GGATGCAGTG TAGAGCATGG TATCTCGAGG CACTAGGAAA ATTGGCCGCC 
CGTGGTCGAT CCTACGTCAC ATCTCGTACC ATAGAGCTCC GTGATCCTTT TAACCGGCGG 
APA RMQC RAW YLE ALGK LAA> 

1870 1880 1890 1900 1910 1920 

* * * * * * * * * * * * 

ATAAAAGCTA AGTCTCCTCG AGCTGTGCAG TTAAGACAAG GAGCTAAGGA AGATTATTCA 
TATTTTCGAT TCAGAGGAGC TCGACACGTC AATTCTGTTC CTCGATTCCT TCTAATAAGT 
I K A KSPR A V Q LRQ GAKE DYS> 

1930 1940 1950 1960 1970 1980 

* * * * * * ** * * ** 

TCCTTTATAG ACAGATTGTT TGCCCAAATA GATCAAGAAC AAAATACAGC TGAAGTTAAG 
AGGAAATATC TGTCTAACAA ACGGGTTTAT CTAGTTCTTG TTTTATGTCG ACTTCAAnC 
SFI DRLF A Q I D Q E QNTA EVK> 

1990 2000 2010 2020 2030 2040 

* * ** ** * * * * * * 

TTATATTTAA AACAGTCATT AAGCATGGCT AATGCTAATG CAGAATGTAA AAAGGCAATG 
AATATAAATT TTGTCAGTAA TTCGTACCGA TTACGATTAC GTCTTACATT TTTCCGTTAC 
L YL KQSL SMA NAN AECK KAM> 

2050 2060 2070 2080 2090 2100 

* * * * ** ** ** ** 

AGCCACCTTA AGCCAGAAAG TACCCTAGAA GAAAAGCTGA GAGCTTGTCA AGAAGTAGGC 
TCGGTGGAAT TCGGTCTTTC ATGGGATCTT CTTTTCGACT CTCGAACAGT TCTTCATCCG 
SHL KPES TLE EKL RACQ EVG> 

2110 2120 2130 2140 2150 2160 

* * ** * * ** * * ** 

TCACCAGGAT ATAAAATGCA ACTCTTGGCA GAAGCTCTTA CAAAAGTTCA AGTAGTGCAA 
AGTGGTCCTA TATTTTACGT TGAGAACCGT CTTCGAGAAT GTTTTCAAGT TCATCACGTT 
SPG YKMQ LLA EAL T K V Q V V Q> 
p25 <- -* plO 

2170 2180 2190 2200 2210 2220 

* * * * * * ** ** *★ 

TCAAAAGGAT CAGGACCAGT GTGTTTCAAC TGT AAAAAA C CAGGACATCT AGCAAAACAG 
AGTTTTCCTA GTCCTGGTCA CACAAAGTTG ACATTTTTTG GTCCTGTAGA TCGTTTTGTC 
SKG SGPV CFN C K K PGHL AKQ> 

2230 2240 2250 2260 2270 2280 
** ** ** ** ★* 

TGTAGAGATG TGAAAAAATG TAATAAATGT GGAAAGCCTG GTCATTTAGC TGCCAMTGC 
ACATCTCTAC ACTTTTTTAC ATTATTTACA CCTTTCGGAC CAGTAAATCG ACGGTTTACG 
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Figure 1 , continued 

CRD V K K C N K C G K P G H L A A K C> 

2290 2300 2310 2320 2330 2340 

* * ** ** * * ** ** 
TGGCAAGGTG GTAAAAAGAA TTCGGGAAAC TGGAAGGCGG GGCGAGCTGC AGCCCCAGTG 
ACCGTTCCAC CATTTTTCTT AAGCCCTTTG ACCTTCCGCC CCGCTCGACG TCGGGGTCAC 
W Q G G K K N S G N W K A G R A A A P V> 

2350 2360 2370 2380 2390 2400 

** ** ** ** * * * * 
AATCAAGTGC AGCAAGCAGT AATGCCATCT GCACCTCCAA TGGAGGAGAG ACTATTGGAT 
TTAGTTCACG TCGTTCGTCA TTACGGTAGA CGTGGAGGTT ACCTCCTCTC TGATAACCTA 
NQV QQAV MPS APP MEER LLO> 

2410 2420 2430 2440 2450 2460 

* * * * * * * * ** ** 
TTATAAATTA TAATAAAGTA GGTACTACTA CAACATTAGA AAAGAGGCCA GAAATACTTA 
AATATTTAAT ATTATTTCAT CCATGATGAT GTTGTAATCT TTTCTCCGGT CTTTATGAAT 

L> 

<r- plO 

* 2470 2480 2490 2500 2510 2520 

* * ** ** * * ** ** 
TATTTGTAAA TGGGTACCCT ATAAAATTTT TATTAGATAC AGGAGCAGAT ATAACAATTT 
ATAAACATTT ACCCATGGGA TATTTTAAAA ATAATCTATG TCCTCGTCTA TATTGTTAAA 

2530 2540 2550 2560 2570 2580 

* * ** * * ** ** ** 
TAAATAGGAG AGATTTTCAA GTAAAAAATT CTATAGAAAA TGGAAGGCAA AATATGATTG 
ATTTATCCTC TCTAAAAGTT CATTTTTTAA GATATCTTTT ACCTTCCGTT TTATACTAAC 

M I> 
-> pol 
ORF1 

2590 2600 2610 2620 2630 2640 

** ** ** ** ** ** 
GAGTAGGAGG AGGAAAGAGA GGAACAAATT ATATCAATGT GCATTTAGAG ATTAGAGATG 
CTCATCCTCC TCCTTTCTCT CCTTGTTTAA TATAGTTACA CGTAAATCTC TAATCTCTAC 
G V G G G K R G T N Y I N V H L E I R D> 

2650 2660 2670 2680 2690 2700 

* * ** ** ** ** ** 
AAAATTATAA GACACAATGT ATATTTGGCA ATGTTTGTGT CTTAGAAGAT AACTCATTAA 
TTTTAATATT CTGTGTTACA TATAAACCGT TACAAACACA GAATCTTCTA TTGAGTAATT 
EN YK TQC I F G NVCV LED NSL> 
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Figure 1, continued 

2710 2720 2730 2740 2750 2760 

* * ** ** ** ** ** 

TACAACCATT ATTAGGGAGA GATAATATGA TTAGATTCAA TATTAGGTTA GTAATGGCTC 
ATGTTGGTAA TAATCCCTCT CTATTATACT AATCTAAGTT ATAATCCAAT CATTACCGAG 
IQPL LGR DNM IRFN I R L V M A> 

2770 2780 2790 2800 2810 2820 

* * * * ** * * * * ** 

AAATTTCTGA CAAGATTCCA ATAGTAAAAG TAAAMTGAA GGATCCAAAT AAAGGACCTC 
TTTAAAGACT GTTCTAAGGT TATCATTTTC ATTTTTACTT CCTAGGTTTA TTTCCTGGAG 
QISD KIP I V K VKMK DPN KGP> 

2830 2840 2850 2860 2870 2880 

* * ** ** ** ** * * 

AAATAAAACA ATGGCCATTA ACAAATGAAA AAATTGAAGC TTTAACAGAA ATAGTAGAAA 
TTTATTTTGT TACCGGTAAT TGTTTACTTT TTTAACTTCG AAATTGTCTT TATCATCTTT 
QIKQ WPL TNE KIEA LTE IV E> 

2890 2900 2910 2920 2930 2940 

* * * * ** ** ** * * 

GACTAGAAAG AGAAGGGAAA GTAAAAAGAG CAGATCCAAA TAACCCATGG AATACACCAG 

CTGATCTTTC TCTTCCCTTT CATTTTTCTC GTCTAGGTTT ATTGGGTACC TTATGTGGTC 
RLER EGK VKR ADPN NPW NTP> 

2950 2960 2970 2980 2990 3000 

** ** ** ** ** ** 

TATTTGCAAT AAAAAAGAAA AGTGGAAAAT GGAGAATGCT CATAGATTTT AGAGAATTGA 
ATAAACGTTA TTTTTTCTTT TCACCTTTTA CCTCTTACGA GTATCTAAAA TCTCTTAACT 
VFAI KKK SGK WRML IDF REL> 

3010 3020 3030 3040 3050 3060 

* * * * * * ** ** ** 

ACAAATTAAC TGAGAAAGGG GCAGAAGTCC AGTTAGGACT CCCTCATCCT GCTGGATTAA 
TGTTTAATTG ACTCTTTCCC CGTCTTCAGG TCAATCCTGA GGGAGTAGGA CGACCTAATT 
NKLT EKG A E V QLGL PHP AGL> 

3070 3080 3090 3100 3110 3120 

* * ** * * ** ** ** 

AAATGAAAAA ACAAGTTACT GTGCTAGATA TAGGAGATGC ATACTTCACT ATTCCCTTGG 
TTTACTTTTT TGTTCAATGA CACGATCTAT ATCCTCTACG TATGAAGTGA TAAGGGAACC 
KMKK QVT VLD IGDA YFT IPL> 

3130 3140 3150 3160 3170 3180 

** * * * * * * ** ** 

ATCCAGACTA TGCTCCCTAT ACTGCATTCA CATTACCTAG AAAGAATAAT GCAGGACCAG 
TAGGTCTGAT ACGAGGGATA TGACGTAAGT GTAATGGATC TTTCTTATTA CGTCCTGGTC 
DPDY APY TAF TLPR KNN AGP> 
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Figure 1, continued 



3190 



l0 3200 3210 3220 3230 3240 

* ** * * * * ** ** 
GGAGGAGATA TGTATGGTGC AGTTTACCAC AGGGGTGGGT TCTAAGCCCA TTGATATATC 
CCTCCTCW ACATACCACG TCAAATGGTG TCCCCACCCA AGATTCGGGT AACTATATAG 
G R R Y V W C S L P Q G W V L S P L I Y> 

3260 3270 3280 3290 3300 



3250 



* 



„ *********** 
AAAGTACTTT AGATAATATA ATACAACCTT TTATTAGACA AAATCCTGAG TTAGATATTT 
TTTCATGAAA TCTATTATAT TATGTTGGAA AATAATCTGT TTTAGGACTC AATCTATAAA 
q S T L D N I I Q P F I R Q N P E L D I> 

3310 3320 3330 3340 3350 3360 

** ** ** ** ** ** 

ATCAATATAT GGATGACATT TATATAGGAT CAAACTTAAG TAAAAAGGAG CATAAAGAAA 
TAGTTATATA CCTACTGTAA ATATATCCTA GTTTGAATTC ATTTTTCCTC GTATTTCTTT 
Y q Y M DDI Y I G S N L S K K E H K E> 

3370 3380 3390 3400 3410 3420 

* * ** ** ** ** * 

AAGTAGAAGA ATTAAGAAM TTGTTATTAT GGTGGGGATT TGAAACCCCG GAAGACAAAT 
TTCATCTTCT TAATTCTTTT AACAATAATA CCACCCCTM ACTTTGGGGC CTTCTGTTTA 
K V E E L R K ILL W W G F E T P E D K> 

3430 3440 3450 3460 3470 3480 

* * ** ** ** ** * 
TACAAGAAGA GCCCCCATAT AAGTGGATGG GCTATGAATT ACATCCATTA ACATGGTCAA 
Itchcttct CGGGGGTATA TTCACCTACC CGATACTTAA TGTAGGTAAT TGTACCAGTT 
LQEE ppy KWM GYEL HPL TWS> 

3490 3500 3510 3520 3530 3540 
*********** 
TACAGCAAAA ACAATTAGAA ATTCCAGAAA GACCCACATT AAATGMCTG CAGAAATTAG 
Itgtcgtttt tgttaatctt TAAGGTCTTT CTGGGTGTAA TTTACTTGAC GTCTTTAATC 
I q q K Q L E I P E R P T L N E L Q K L> 

3550 3560 3570 3580 3590 3600 
** ** ** ** ** * 
CAGGTAAGAT AAACTGGGCC AGTCAAACTA TCCCAGACTT MGTATAAAA GAACTAACTA 
GTCCATTCTA TTTGACCCGG TCAGTTTGAT AGGGTCTGAA TTCATATTTT CTTGATTGAT 
A G K I N W A S Q T I P D L S I K E L T> 

3610 3620 3630 3640 3650 3660 
************ 
ACATGATGAG AGGAGATCAG AAGTTAGACT CAATAAGAGA ATGGACTGTG GAAGCCAAGA 
TGTACTACTC TCCTCTAGTC TTCAATCTGA GTTATTCTCT TACCTGACAC CTTCGGTTCT 
N M M R G D Q K L D SIRE W T V E A K> 
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Figure 1, continued 

3670 3680 3690 3700 3710 3720 

* * ** ** ** ** * * 

GAGAAGTACA AAAAGCTAAG GAAGCTATTG AGATGCAAGC ACAGCTAAAT TATTATGATC 
CTCTTCATGT TTTTCGATTC CTTCGATAAC TCTACGTTCG TGTCGATTTA ATAATACTAG 
R E V Q KAK E A I EMQA QLN YYD> 

3730 3740 3750 3760 3770 3780 

* * ** * * ** ** ** 

CCCACCGAGA ATTATATGCA AAATTAAGTT TAGTGGGACC ACATCAAATA TGTTATCAAG 
GGGTGGCTCT TAATATACGT TTTAATTCAA ATCACCCTGG TGTAGTTTAT ACAATAGTTC 
PHRE LYA KLS L V G P H Q I CYQ> 

3790 3800 3810 3820 3830 3840 

* * * * ** * * * * ** 

TGTATCATAA GAACCCAGM TGTATTTTAT GGTATGGTAA GATGAATAGA CAAAAGAAAA 
ACATAGTATT CTTGGGTCTT ACATAAAATA CCATACCATT CTACTTATCT GTTTTCTTTT 
VYHK NPE CIL WYGK MNR QKK> 

3850 3860 3870 3880 3890 3900 

** ** * * ** ** * ★ 

AGGCAGAAAA TACCTGTGAT ATAGCTCTAA GGGCATGTTA TAAAATAAGA GAAGAATCTA 
TCCGTCTTTT ATGGACACTA TATCGAGATT CCCGTACAAT ATTTTATTCT CTTCTTAGAT 
KAEN TCD I A L RACY KIR EES> 

3910 3920 3930 3940 3950 3960 

* * ** ** * ★ * * ** 

TTATAAGAAT AGGAAAAGAA CCAATATATG AAATACCTAC TTCTAGAGAA GCCTGGGAGT 
AATATTCTTA TCCTTTTCTT GGTTATATAC TTTATGGATG AAGATCTCTT CGGACCCTCA 
I I R I GKE PIY EIPT SRE AWE> 

3970 3980 399 0 4000 4010 4020 

* * * * * * * * ** * * 

CAAATTTAAT TAATTCACCA TATCTTAAGG CCCCACCTCC TGAGGTAGAA TATATCCATG 
GTTTAAATTA ATTAAGTGGT ATAGMTTCC GGGGTGGAGG ACTCCATCTT ATATAGGTAC 
SNLI NSP YLK APPP EVE YIH> 

4030 4040 4050 4060 4070 4080 

* * ** ** ** * * ** 

CTGCTGTGAA TATAAAAAGA GCATTAAGTA TGATAAAAGA TGTTCCAATA CCAGAAGCAG 
GACGACACTT ATATTTTTCT CGTAATTCAT ACTATTTTCT ACAAGGTTAT GGTCTTCGTC 
A A V N IKR ALS MIKD VPI PEA> 

4090 4100 4110 4120 4130 4140 

* * * * ** ** * * ★ * 

AAACGTGGTA TATAGATGGA GGCAGAAAGC TAGGAAAAGC AGCAAAAGCA GCCTATTGGA 
TTTGCACCAT ATATCTACCT CCGTCTTTCG ATCCTTTTCG TCGTTTTCGT CGGATAACCT 
ETWY IDG GRK LGKA AKA AYW> 
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Figure 1, continued 

4150 4160 4170 4180 4190 4200 

* * * * * * * * * * * * 
rATATATAGG GAAGTGGCAA GTAATGGAGT TAGAAGGCAG TAATCAGAAG GCAGAAGTAC 

fcSc mcIcCGH CAmCCTCA ATCTTCCGTC ATTAGTCTTC CGTCTTCATG 
T D T G K W Q V N E LEGS N Q K A E V> 

4210 4220 4230 4240 4250 4260 

* * ********** 
AAfTATTATT ATTGGCATTA AAAGCAGGAT CAGAGGAAAT GAATATTATA ACAGATTCAC 
5S » TTTCGTCCTA GTCTCCTTTA CTTATAATAT TGTCTAAGTG 
q A L L L A L K A G SEEM Nil T D S> 

4270 4280 4290 4300 4310 4320 

* * ********** 

aataTCTTAT AAATATTATT CTTCAACAAC CAGATATGAT GGAGGGAATC TGGCAAGAAG 

hSa ™aa gaagttgttg gtctatacta cctccc™ accgttcttc 

qYVI Nil LQQ PDMM EGI W Q t> 
4330 4340 ^ 4350 ^ 4360 ^ 4370 ^ 4380 

TTTtIgAAgI ATTGGAGAaI AAAACAGCM TATTTATAGA TTGGSTra 6GACATAAAG 

AAAATCTTCT taacctcttt ttttgtcgtt ataaatatct aacccagggt cctgtatttc 

VLEE LEK K T A IFID WVP G H K> 



4390 4400 ^4410 ^ 4420 ^ 4430 ^ 4440 

GTATTCCAGG AAATGAGGAA GTAGATAAGC TTTGTCAAAC AATGATGATA ATAGAAGGGG 
mACTCCTT CATCTATTCG AAACAGTTTG TTACTACTAT TATCTTC C 
G I p G NEE V D K L C Q T M M I I E G> 

4450 4460 ^ 4470 ^ 4480 ^ 4490 ^ 4500 

ATTfTATATT AGATAAAAGG TCAGAAGATG CGGGATATGA TTTATTGGCT GCAAAAGAAA 
5KS AGTCTTCTAC GCCCTATACT AAATAACCGA CGTmcm 

D G I L D K R S E D A G Y D L L A A K E> 

4510 4520 ^ 4530 ^ 4540 ^ 4550 ^ 4560 

TArATTTATT GCCAGGAGAG GTAAAAGTAA TACCAACAGG GGTAAAGCTA ATGCTGCCTA 
mm cggStc CATTTTCATT atgghgtcc CCATTTCGAT TACGACGGAT 
IHLL PGE V K V IPTG V K L MLP> 

4570 4580 4590 4600 4610 4620 

* * ** ** ** ** * 
aapp AfATTG GGGACTAATA ATGGGAAGAA GCTCGATAGG GAGTAAAGGA TTGGATGTAT 
iKE CCCTGATTAT TACCCTTCTT CGAGCTATCC CTCATTTCCT AACCTACATA 
K G H W G L I M G R S S I G S K G L D V> 
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Figure 1, continued 

4630 4640 4650 4660 4670 4680 

*.* ** ** * * * * * * 

TAGGAGGGGT AATAGATGAA GGATATCGAG GTGAAATTGG AGTAATAATG ATTAATGTAT 
ATCCTCCCCA TTATCTACTT CCTATAGCTC CACTTTAACC TCATTATTAC TAATTACATA 
LGGV IDE GYR GEIG VIM I N V> 

4690 4700 4710 4720 4730 4740 

* * ** ** ** ** *★ 

CAAGAAAATC AATCACCTTA ATGGAACAAC AAAAGATAGC ACAATTAATA ATATTGCCTT 
GTTCTTTTAG TTAGTGGAAT TACCTTGTTG TTTTCTATCG TGTTAATTAT TATAACGGAA 
SRKS ITL MEQ Q K I A QLI ILP> 

4750 4760 4770 4780 4790 4800 

* * * * ** * * ** ** 

GTAAACATGA AGTATTAGAA CAAGGAAAAG TTGTAATGGA TTCAGAGAGA GGAGACAAAG 
CATTTGTACT TCATAATCTT GTTCCTTTTC AACATTACCT AAGTCTCTCT CCTCTGTTTC 
CKHE VLE QGK VVMD SER GDK> 

4810 4820 4830 4840 4850 4860 

* * * * ** * * ** * * 

GTTATGGGTC AACAGGAGTA TTCTCCTCTT GGGTTGACAG GATT6AGGAA GCAGAAATAA 
CAATACCCAG TTGTCCTCAT AAGAGGAGAA CCCAACTGTC CTAACTCCTT CGTCTTTATT 
GYGS TGV FSS WVDR IEE AEI> 

4870 4880 4890 4900 4910 4920 

* * * * * * * * ** * * 

ATCATGAAAA ATTTCACTCA GATCCACAAT ACTTAAGGAC TGAATTTAAT TTACCCAAGA 
TAGTACTTTT TAAAGTGAGT CTAGGTGTTA TGAATTCCTG ACTTAAAnA AATGGGTTCT 
NHEK FHS DPQ YLRT EFN LPK> 

4930 4940 4950 4960 4970 4980 

* * ** ** ** ** ** 

TGGTTGCAGA AGAGATAAGA CGAAAGTGCC CTGTATGTAG AATCAGAGGA GAACAAGTGG 
ACCAACGTCT TCTCTATTCT GCTTTCACGG GACATACATC TTAGTCTCCT CTTGTTCACC 
MVAE EIR RKC PVCR I R G EQV> 

4990 5000 5010 5020 5030 5040 

* * ** * * ** ** ** 

GAGGACAATT GAAAATAGGG CCTGGAATAT GGCAAGTGGA TTGCACACAC TTTAATAGTA 
CTCCTGTTAA CTTTTATCCC GGACCTTATA CCGTTCACCT AACGTGTGTG AAATTATCAT 
GGQL KIG PGI WQVD CTH FNS> 

5050 5060 5070 5080 5090 5100 

* * ** ** ** * * ★ * 

AGATAATCAT TGTAGCAGTA CATGTGGAAT CAGGATTTTT ATGGGCACAG ATAATTCCAC 
TCTATTAGTA ACATCGTCAT GTACACCTTA GTCCTAAAAA TACCCGTGTC TATTAAGGTG 
Kill VAV HVE SGFL WAQ I I P> 
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Figure 1, continued 

5110 5120 5130 5140 5150 5160 

* * * * * * * * * * * * 
AGGAGACTGC AGATTGTACA GTCAAGGCTC TTCTGCAACT TATATGTGCT CATAATGTTA 
TCCTCTGACG TCTAACATGT CAGTTCCGAG AAGACGTTGA ATATACACGA GTATTACAAT 
QETA DCT V K A LLQL ICA HNV> 

5170 5180 5190 5200 5210 5220 

************ 
CAGAATTACA AACAGACAAT GGACCAAATT TTAAAAATCA GAAAATGGAA GGTTTATTAA 
GTCTTAATGT TTGTCTGTTA CCTGGTTTAA AATTTTTAGT CTTTTACCTT CCAAATAATT 
tel q TDN GPN FKNQ KME GLL> 

5230 5240 5250 5260 5270 5280 
** ** ****** ** 
ATTTTATGGG AATAAAACAT AAATTAGGGA TACCAGGTAA CCCACAATCA CAGGCATTAG 
TAAAATACCC TTATTTTGTA TTTAATCCCT ATGGTCCATT GGGTGTTAGT GTCCGTAATC 
N F M G I K H K L G I P 6 N P Q S Q A L> 

5290 5300 5310 5320 5330 5340 
** ** ** ** ** ** 

TGGAAAATGC TAATAACACA TTAAAAGCTT GGATTCAAAA ATTCCTACCA GAGACTACCT 
ACCTTTTACG ATTATTGTGT AATTTTCGAA CCTAAGTTTT TAAGGATGGT CTCTGATGGA 
VENA NNT LKA WIQK FLP ETT> 

5350 5360 5370 5380 5390 5400 

* * ** ** ** ** ** 
CTCTGGATAA TGCTCTGGCC CTAGCCCTGT ATAGTCTCAA CTTTAAACAA AGGGGTAGAC 
GAGACCTATT ACGAGACCGG GATCGGGACA TATCAGAGTT GAAATTTGTT TCCCCATCTG 
S L D N ALA L A L Y S L N F K Q R G R> 

5410 5420 5430 5440 5450 5460 

************ 
TAGGAAGGAT GGCCCCTTAT GAATTATACA TACAACAAGA ATCATTAAGA ATACAAGACT 
ATCCTTCCTA CCGGGGAATA CTTAATATGT ATGTTGTTCT TAGTAATTCT TATGTTCTGA 
LGRM APY ELY I Q Q E SLR IQD> 

5470 5480 5490 5500 5510 5520 

* * ** ** ** ** ** 
ATTTTTCGCA GATTCCACAA AAGTTAATGA TGCAGTGGGT GTATTACAAA GATC AAAAAG 
TAAAAAGCGT CTAAGGTGTT TTCAATTACT ACGTCACCCA CATAATGTTT CTAGTTTTTC 
YFSQ IPQ K L M MQWV YYK D Q K> 

5530 5540 5550 5560 5570 5580 

* * ** ** ** ** ** 

ACAAAAAATG GAAGGGACCA ATGAGAGTGG AATATTGGGG ACAAGGATCA GTATTATTAA 
TGTTTTTTAC CTTCCCTGGT TACTCTCACC TTATAACCCC TGTTCCTAGT CATAATAATT 
DKKW KGP MRV EYWG QGS VLL> 
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Figure 1 , continued 

5590 5600 5610 5620 5630 5640 

* * * * * * ** * * * * 

AGGATGAAGA GAAGGGATAT TTTCTTGTAC CTAGGAGACA CATAAGAAGA GTCCCAGAAC 
TCCTACTTCT CTTCCCTATA AAAGAACATG GATCCTCTGT GTATTCTTCT CAGGGTCTTG 
KDEE KGY F L V PRRH I R R VPE> 

5650 5660 5670 5680 5690 5700 

* * ** ** ** ★* * * 

CCTGCACTCT TCCTGAAGGG GATGAGTGAC GAAGATTGGC AGGTAAGTAG AAGACTCTTT 
GGACGTGAGA AGGACTTCCC CTACTCACTG CTTCTAACCG TCCATTCATC TTCTGAGAAA 

MSD EDW QVSR RLF> 
ORF 2 -> 
PCTL PEG DE> 

<- ORF1 

5710 5720 5730 5740 5750 5760 

* * * * * * ** * * ** 

GCAGTGCTCC AAGGAGGAGT ACGTAGTGCT ATGCTATACA TATCTAGACT ACCTCCGGAC 
CGTCACGAGG TTCCTCCTCA TGCATCACGA TACGATATGT ATAGATCTGA TGGAGGCCTG 
AVL QGGV RSA MLY ISRL PPD> 



5770 5780 5790 5800 5810 5820 

* * ** ** ** ** * * 

GAAAGAGAAA GGTATAAAAA AGACTTTAAG AAAAGGCTTT TGGAAAAGGA AACAGGATTC 
CTTTCTCTTT CCATATTTTT TCTGAAATTC TTTTCCGAAA ACCTTTTCCT TTGTCCTAAG 
ERE RYKK DFK KRL LEKE TGF> 

5830 5840 5850 5860 5870 5880 

* * ** * * ** *★ ** 

ATACAGAGAT TAAGAAAAGC GGAAGGAATA AGGTGGAGCT TCCATACTAG AGATTATTAT 
TATGTCTCTA ATTCTTTTCG CCTTCCTTAT TCCACCTCGA AGGTATGATC TCTAATAATA 
IQR LRKA EGI RWS FHTR DYY> 

5890 5900 5910 5920 5930 5940 

** ** ** ** ** ** 

ATAGGATATG TAAGAGAGAT GGTGGCCGGA TCTAGTCTAC CAGATAGTTT AAGACTGTAT 
TATCCTATAC ATTCTCTCTA CCACCGGCCT AGATCAGATG GTCTATCAAA TTCTGACATA 
IGY VREM VAG SSL PDSL RLY> 

5950 5960 5970 5980 5990 6000 

* * ** * * * * * * ** 

ATTTATATAA GCAATCCATT GTGGCACTGG TCATACCGTC CTGGCCTGAC AAATTTTAAT 
TAAATATATT CGTTAGGTAA CACCGTGACC AGTATGGCAG GACCGGACTG TTTAAAATTA 
! Y I SNPL WHW SYR PGLT NFN> 
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Figure 1, continued 

6010 6020 6030 6040 6050 6060 

* * * * * * ** * * ** 

ACA6AATGGC CTTTTGTGAA TATGTGGATA AAGACAGGAT TCATGTGGGA TGATATTGAA 
TGTCTTACCG GAAAACACTT ATACACCTAT TTCTGTCCTA AGTACACCCT ACTATAACTT 
TEW PFVN M W I KTG FMWD D I E> 

607 0 6080 6090 6100 6110 6120 

* * ** * * ** * * ** 
AGCCAGAATA TTTGCAAAGG AGGAGAGATT TCACATGGAT GGGGACCTGG AATGGTGGGA 
TCGGTCTTAT AAACGTTTCC TCCTCTCTAA AGTGTACCTA CCCCTGGACC TTACCACCCT 

SQN ICKG G E I SHG WGPG MVG> 

6130 6140 6150 6160 6170 6180 

* * * * * * ** ** ** 

ATTGTGATAA MGCTTTTAG TTGTGGAGAA AGAAAGATTG AGGCTACTCC TGTAATGATT 
TAACACTATT TTCGAAAATC AACACCTCTT TCTTTCTAAC TCCGATGAGG ACATTACTAA 
I V I K A F S C G E R K I E A T P V M I> 

6190 6200 6210 6220 6230 6240 

* * ** * * ** * * ** 

ATAAGAGGAG AAATAGATCC AAAAAAATGG TGTGGAGATT GTTGGAATTT GATGTGTCTT 
TATTCTCCTC TTTATCTAGG 1 1 1 1 1 H ACC ACACCTCTAA CAACCTTAAA CTACACAGAA 
I r G EIDP KKW CGD CWNL MCL> 

6250 6260 6270 6280 6290 6300 

* * ** * * * * * * * * 
AGGAACTCAC CTCCACAGAC TTTACAAAGA CTTGCTATGT TGGCATGTGG CGTGCCGGCT 
TCCTTGAGTG GAGGTGTCTG AAATGTTTCT GAACGATACA ACCGTACACC GCACGGCCGA 

RN S PPQT LQR LAM LACG VPA> 

6310 6320 6330 6340 6350 6360 

* * ** * * * * * * ** 
AAGGAGTGGC GAGGATGCTG TAATCAACGC TTTGTTTCTC CTTACAGAAC GCCTGCTGAT 
TTCCTCACCG CTCCTACGAC ATTAGTTGCG AAACAAAGAG GAATGTCTTG CGGACGACTA 

KEW RGCC NQR FVS PYRT PAD> 



6370 



6380 6390 6400 6410 6420 



TTGGAGGTCA TTCAATCCAA GCCCAGCTGG AGTCTATTAT GGTCAGGGAG CCTATGAATG 
AACCTCCAGT AAGTTAGGTT CGGGTCGACC TCAGATAATA CCAGTCCCTC GGATACTTAC 
LEV IQSK PSW SLL WSGS L> 

0RF2 <- 



6430 6440 6450 6460 6470 6480 

** ** ★ * ** ** ** 

GAAGACATAC TAACATTATT TAATAAGGTC ACTAAGAAAC TAGAAAAGGA AAAAGCTATC 
CTTCTGTATG ATTGTAATAA ATTATTCCAG TGATTCTTTG ATCTTTTCCT TTTTCGATAG 
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Figure 1, continued 

6490 6500 6510 6520 6530 6540 

* * * * ** ** * * ** 

AGAATATTTG TATTAGCACA TCAATTAGAA AGGGACAAAG TTATTAGATT ACTACAAGGA 
TCTTATAAAC ATAATCGTGT AGTTAATCTT TCCCTGTTTC AATAATCTAA TGATGTTCCT 

6550 6560 6570 6580 6590 6600 

* * ** ******** 

TTAGTTTGGA GACATAGATT TAAGAAACCC CAAACAAAAT ACTGTTTATG TTGGTTCTGT 
AATCAAACCT CTGTATCTAA ATTCTTTGGG GTTTGTTTTA TGACAAATAC AACCAAGACA 

6610 6620 6630 6640 6650 6660 

* * ** ******** 

TGCAAATTCT ACTATTGGCA GTTGCAATCT ACATTATCAA TAACTACTGC TTAGAAATAC 
ACGTTTAAGA TGATAACCGT CAACGTTAGA TGTAATAGTT ATTGATGACG AATCTTTATG 

6670 6680 6690 6700 6710 6720 

* * ** ** ** ** ** 

TTATAATAAT ATTTCATTTG CAACAATAAT TATGGCAGAA GGATTTGCAG CCAATAGACA 
AATATTATTA TAAAGTAAAC GTTGTTATTA ATACCGTCTT CCTAAACGTC GGTTATCTGT 

MAE GFA A N R Q> 
-> ENV 

6730 6740 6750 6760 6770 6780 

* * ********** 
ATGGATAGGA CCAGAAGAAG CTGAAGAGTT ATTAGATTTT GATATAGCAA CACAAATGAA 
TACCTATCCT GGTCTTCTTC GACTTCTCAA TAATCTAAAA CTATATCGTT GTGTTTACTT 

WIG PEE AEEL LDF DIA TQMN> 

6790 6800 6810 6820 6830 6840 

* * ** ** ** ** ** 

TGAAGAAGGG CCACTAAATC CAGGGATGAA CCCATTTAGG GTACCTGGAA TAACAGATAA 
ACTTCTTCCC GGTGATTTAG GTCCCTACTT GGGTAAATCC CATGGACCTT ATTGTCTATT 
EEE PLN PGMN PFR VPG I T D K> 

6850 6860 6870 6880 6890 6900 

* * ********** 

AGAAAAGCAA GACTATTGTA ACATATTACA ACCTAAGTTA CAAGATTTAC GGAATGAACT 
TCTTTTCGTT CTGATAACAT TGTATAATGT TGGATTCAAT GTTCTAAATG CCTTACTTGA 
EKQ DYC NILQ PKL QDL RNEL> 

6910 6920 6930 6940 6950 6960 

* * ** ** ** ** ** 

TCAAGAGGTA AAACTAGAAG AAGGAAATGC AGGTAAGTTT AGAAGGGCAA GATATTTAAG 
AGTTCTCCAT TTTGATCTTC TTCCTTTACG TCCATTCAAA TCTTCCCGTT CTATAAATTC 
QEV KLE EGNA G K F RRA RYLR> 
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Figure 1, continued 



6970 6980 6990 7000 7010 7020 

** ** ** ** ** * * 
ATATTCTGAT GAAAATGTGC TATCTATAGT CTATTTGCTA ATAGGATATC TAAGATATTT 
TATAAGACTA CTTTTACACG ATAGATATCA GATAAACGAT TATCCTATAG ATTCTATAAA 
Y s D E N V L S I V Y L L I G Y L R Y L> 

7030 7040 7050 7060 7070 7080 

** ** ** ** ** ** 
AATAAATCGT AGGAGTTTAG GATCTTTAAG ACATGATATA GACATAGAAA CACCTCAAGA 
TTATTTAGCA TCCTCAAATC CTAGAAATTC TGTACTATAT CTGTATCTTT GTGGAGTTCT 
I N R R S L G S L R H D I DIE T P Q E> 

7090 7100 7110 7120 7130 7140 

** * * ** ** * * * 

GGAATATTAT AGTAATAGTG AAAGGGGTAC CACATTAAAT CAAAAATATG CGAGAAGATG 
CCTTATAATA TCATTATCAC TTTCCCCATG GTGTAATTTA GTTTTTATAC GCTCTTCTAC 
£ Y Y S N S ERGT TLN QKY ARRC> 

7150 7160 7170 7180 7190 7200 



★ * * 



* ** ** * * * * 



TTGTGTTAGC ACACTTATTA TGTATTTAAT TCTTTTTGCA GTAGGCATCT GGTGGGGAGC 
AACACAATCG TGTGAATAAT ACATAAATTA AGAAAAACGT CATCCGTAGA CCACCCCTCG 
CVS TLI MYLI LFA VGI WWGA 

7210 7220 7230 7240 7250 7260 

* * ** ** ** * * * 
TAGAGCACAA GTAGTGTGGA GACTTCCCCC TTTAGTAGTT CCAGTAGAAG AATCAGAAAT 
ATCTCGTGTT CATCACACCT CTGAAGGGGG AAATCATCAA GGTCATCTTC TTAGTCTTTA 
r A Q V V W R L P P L V V P V E E S E I> 

7270 7280 7290 7300 7310 7320 

** ** ** ** ** * 
AATTTTTTGG GATTGTTGGG CACCAGAAGA ACCCGCCTGT CAAGACTTTC TTGGGGCAAT 
JXAAAAAACC CTAACAACCC GTGGTCTTCT TGGGCGGACA GTTCTGAAAG AACCCCGTTA 
I F W DCW APEE PAC QDF LGAM> 

7330 7340 7350 7360 7370 7380 

** ** ** ** ** * 
GATACATCTA AAAGCTAGTA CGAATATAAG TATACAAGAG GGACCTACCT TGGGGAATTG 
SS TTTCGATCAT GCTTATATTC ATATGTTCTC CCTGGATGGA ACCCCTTAAC 
I H L K A S TNIS I Q E GPT LGNW> 

7390 7400 7410 7420 7430 7440 

* *********** 
GGCTAGAGAA ATATGGGGAA CATTATTCAA AAAGGCTACC AGACAATGTA GAAGAGGTAG 
CCGATCTCTT TATACCCCTT GTAATAAGTT TTTCCGATGG TCTGTTACAT CnCTCCATC 
AR E I W 6 TLFK KAT RQC RRGR> 
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Figure 1, continued 

7450 7460 7470 7480 7490 7500 

** * * ******** 

AATATGGAAA AGATGGAATG AAACTATAAC AGGACCATTA GGATGTGCTA ATAACACATG 
TTATACCTTT TCTACCTTAC TTTGATATTG TCCTGGTAAT CCTACACGAT TATTGTGTAC 
I W K RWN ETIT GPL GCA NNTO 

7510 7520 7530 7540 7550 7560 

* ★ *★ * * ** ** ** 

TTATAATATT TCAGTAATAG TACCTGATTA TCAATGTTAT CTAGACCGAG TAGATACTTG 
AATATTATAA AGTCATTATC ATGGACTAAT AGTTACAATA GATCTGGCTC ATCTATGAAC 
Y N I SVI VPDY QCY LDR VDTW> 

7570 7580 7590 7600 7610 7620 

* * ** ** ****** 

GTTACAAGGG AAAGTAAATA TATCATTATG TCTAACAGGA GGAAAAATGT TGTACAATAA 
CAATGTTCCC TTTCATTTAT ATAGTAATAC AGATTGTCCT CCTTTTTACA ACATGTTATT 
LQG KVN ISLC LTG GKM LYNK> 

7630 7640 7650 7660 7670 7680 

* * ** ******** 

ATATACAAAA CAATTAAGCT ATTGTACAGA CCCATTACAA ATCCCACTGA TCAATTATAC 
TATATGTTTT GTTAATTCGA TAACATGTCT GGGTAATGTT TAGGGTGACT AGTTAATATG 
YTK QLS YCTD PLQ I P L INYT 

7690 7700 7710 7720 7730 7740 

* * ** ** ****** 

ATTTGGACCT AATCAAACAT GTATGTGGAA CACTTCACAA ATTCAGGACC CTGAGATACC 
TAMCCTGGA TTAGTTTGTA CATACACCTT GTGAAGTGTT TMGTCCTGG GACTCTATGG 
FGP NQT CMWN TSQ I Q D PEIP> 

7750 7760 7770 7780 7790 7800 

* * * * ** ** ** ** 

AAAATGTGGA TGGTGGAATC AAAGAGCCTA TTATAAAAAT TGTAAATGGG AAAAAACAGA 
TTTTACACCT ACCACCTTAG TTTCTCGGAT AATATTTTTA ACATTTACCC TTTTTTGTCT 
KCG WWN QRAY YKN CKW E K T D> 

7810 7820 7830 7840 7850 7860 

* * ** ** ** ** ** 

TGTAMGTTT CATTGTCAAA GAACACAGAG TCAGCCTGGA ACATGGCTTA GAGCAATCTC 
ACATTTCAAA GTAACAGTTT CTTGTGTCTC AGTCGGACCT TGTACCGAAT CTCGTTAGAG 
V K F HCQ RTQS QPG TWL RAIS> 

7870 7880 7890 7900 7910 7920 

**• * * ** ** ** ** 

GTCATGGAGA CAAAGGAATA GATGGGAATG GAGACCAGAT TTTGAAAGTG AAAAGGTGAA 
CAGTACCTCT GTTTCCTTAT CTACCCTTAC CTCTGGTCTA AAACTTTCAC TTTTCCACTT 
SWR QRN RWEW RPD FES EKVK> 
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Figure 1 , continued 

7930 7940 7950 7960 7970 7980 

* * * * ** ** ** ** 
AATATCTCTA AAGTGTAATA GCACAAAAAA CCTAACCTTT GCAATGAGAA GTTCAGGAGA 
TTATAGAGAT TTCACATTAT CGTCTTTTTT GGATTGGAAA CGTTACTCTT CAAGTCCTCT 

I S L KCN STKN LTF AMR SSGD> 

7990 8000 8010 8020 8030 8040 

* * ** ** ** ** * * 
TTATGGAGAA GTAACGGGAG CTTGGATAGA GTTTGGATGT CATAGAAATA AATCAAAACT 
AATACCTCTT CATTGCCCTC GAACCTATCT CAAACCTACA GTATCTTTAT TTAGTTTTGA 

YQE VTG A W I E FGC HRN KSKL> 

80 50 8060 8070 8080 8090 8100 

* * ** ******** 

TCATGATGAA GCAAGGTTTA GAATTAGATG TAGATGGAAT ATAGGGGAGA ATACCTCACT 

AGTACTACTT CGTTCCAAAT CTTAATCTAC ATCTACCTTA TATCCCCTCT TATGGAGTGA 

H D E A R F R I R C R W N I G E N T S L> 

8110 8120 8130 8140 8150 8160 

** ** ******** 

CATTGATACA TGTGGAAACA CTCAAAATGT TTCAGGGGCA AATCCTGTAG ATTGTACCAT 
GTAACTATGT ACACCTTFGT GAGTTTTACA AAGTCCCCGT TTAGGACATC TAACATGGTA 
IDT CGN TQNV SGA NPV DCTM> 

8170 8180 8190 8200 8210 8220 

************ 
GTATGCAAAT AAAATGTACA ATTGTTCTTT ACAAAACGGG TTTACTATGA AGGTAGATGA 
CATACGTTTA TTTTACATGT TAACAAGAAA TGTTTTGCCC AAATGATACT TCCATCTACT 
VAN KMY NCSL QNG FTM K V D D> 

8240 8250 8260 8270 8280 



8230 
* * 



★ * ** ** ** ** 



CCTTATTATG CATTTCAATA TGACAAAAGC TGTAGAAATG TATAATATTG CTGGAAATTG 
GGAATAATAC GTAAAGTTAT ACTGTTTTCG ACATCTTTAC ATATTATAAC GACCTTTAAC 
LIM HFN MTKA V E M YNI AGNW> 

8290 8300 8310 8320 8330 8340 

* * ** ** ** ** ** 

GTCTTGTACA TCTGACTTGC CACCAACATG GGGGTATATG AATTGTAACT GTACAAATAA 

CAGAACATGT AGACTGAACG GTGGTTGTAC CCCCATATAC TTAACATTGA CATGTTTATT 

SC T SDL P P T W GYM N C N C T N N> 

8350 8360 8370 8380 8390 8400 

** ** ** ** ** ** 
TAGTAATGAT AATACTAGAA TGGCATGTCC TAACAATCAA GGCATCTTAA GGAATTGGTA 
ATCATTACTA TTATGATCTT ACCGTACAGG ATTGTTAGTT CCGTAGAATT CCTTAACCAT 
SND NTR MACP NNQ GIL RNWY> 
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Figure 1 , continued 

8410 8420 8430 8440 8450 8460 

* * ** * * ** * * * * 

TAACCCAGTA GCAGGATTAC GACAATCCTT GGAAAAGTAT CAAGTTGTAA AACAACCAGA 
ATTGGGTCAT CGTCCTAATG CTGTTAGGAA CCTTTTCATA GTTCAACATT TTGTTGGTCT 
NPV AGL RQSL EKY QVV KQPD> 

8470 8480 8490 8500 8510 8520 

* * ** ** ** * * ** 

TTACTTAGTG GTCCCAGGGG AAGTCATGGA ATATAAAACT AGAAGGAAAA GGGCAGCTAT 
AATGAATCAC CAGGGTCCCC TTCAGTACCT TATATTTTGA TCTTCCTTTT CCCGTCGATA 
YLV VPG E V M E YKT RRK RAAI> 

8530 8540 8550 8560 8570 8580 

* * *★ ** * * * * ** 

TCATGTTATG TTAGCTCTTG CAACAGTATT ATCTATGGCC GGAGCAGGGA CGGGGGCTAC 
AGTACAATAC AATCGAGAAC GTTGTCATAA TAGATACCGG CCTCGTCCCT GCCCCCGATG 
HVM LAL ATVL SMA GAG TGAT> 

8590 8600 8610 8620 8630 8640 

** ** ** ** ** ** 

TGCTATAGGG ATGGTAACAC AATATCACCA AGTTCTAGCA ACCCATCAAG AAGCTATTGA 
ACGATATCCC TACCATTGTG TTATAGTGGT TCAAGATCGT TGGGTAGTTC TTCGATAACT 
A I G MVT QYHQ V L A THQ E A I E> 

8650 8660 8670 8680 8690 8700 

* * * * ** ** * * * * 

AAAGGTGACT GAAGCCTTAA AGATAAACAA CTTGAGATTA GTTACATTAG AGCATCAAGT 
TTTCCACTGA CTTCGGAATT TCTATTTGTT GAACTCTAAT CAATGTAATC TCGTAGTTCA 
KVT EAL KINN LRL VTL EHQV> 

8710 8720 8730 8740 8750 8760 

* * * * * * * * ** * * 

ACTAGTAATA GGATTAAAAG TAGAAGCTAT GGAAAAATTT TTATATACAG CTTTCGCTAT 
TGATCATTAT CCTAATTTTC ATCTTCGATA CCTTTTTAAA AATATATGTC GAAAGCGATA 
L V I GLK VEAM EKF LYT A F A M> 

8770 8780 8790 8800 8810 8820 

* * * * ** ** * * * * 

GCAAGAATTA GGATGTAATC AAAATCAATT CTTCTGCAAA GTCCCTCCTG AATTGTGGAT 
CGTTCTTAAT CCTACATTAG TTTTAGTTAA GAAGACGTTT CAGGGAGGAC TTAACACCTA 

n f i g r n n n n f fc k v p p e l w m> 
TM Peptide 

8830 8840 8850 8860 8870 8880 

* * * * * * * * ** ** 

GAGGTATAAT ATGTCTATAA ATCAAACAAT ATGGAATCAT GGAAATATAA CTTTGGGGGA 
CTCCATATTA TACAGATATT TAGTTTGTTA TACCTTAGTA CCTTTATATT GAMCCCCCT 
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Figure 1, continued 
RYN MSI NQTI WNH G N I TLGE> 

8890 8900 8910 8920 8930 8940 

* * * * ** ** ** * * 

ATGGTATAAC CAAACAAAAG ATTTACAACA AAAGTTTTAT GAAATAATAA TGGACATAGA 
TACCATATTG GTTTGTTTTC TAAATGTTGT TTTCAAAATA CTTTATTATT ACCTGTATCT 
U Y N Q T K D L Q Q K F Y E I I M D I E> 



8950 8960 8970 8980 8990 9000 

* * * * ** ** ** * * 

ACAAAATAAT GTACAAGGGA AAAAAGGGAT ACAACAATTA CAAAAGTGGG AAGATTGGGT 
TGTTTTATTA CATGTTCCCT TTTTTCCCTA TGTTGTTAAT GTTTTCACCC TTCTAACCCA 
Q NN VQG KKGI QQL QKW EDWV> 

9010 9020 9030 9040 9050 9060 

* * * * * * * * * * ** 
AGGATGGATA GGAAATATTC CACAATACTT AAAGGGACTA TTGGGAGGTA TCTTGGGAAT 
TCCTACCTAT CCTTTATAAG GTGTTATGAA TTTCCCTGAT AACCCTCCAT AGAACCCTTA 

6 w I G N I PQYL KGL LGG I L G I> 

9070 9080 9090 9100 9110 9120 

* * * * * * ** * * * * 
AGGATTAGGA GTGTTATTAT TAATTTTATG TTTACCCACA TTGGTTGATT GTATAAGAAA 
TCCTAATCCT CACAATAATA ATTAAAATAC AAATGGGTGT AACCAACTAA CATATTCTTT 

G L G V L L L I L C LPT LVD C I R N> 

9130 9140 9150 9160 9170 9180 

* * ** * * ** ** ** 
TTGTATCCAC AAGATACTAG GATACACAGT AATTGCAATG CCTGAAGTAG AAGGAGAAGA 
AACATAGGTG TTCTATGATC CTATGTGTCA TTAACGTTAC GGACTTCATC TTCCTCTTCT 

C I H K I L GYTV I A M PEV EGEE> 

9190 9200 9210 9220 9230 9240 

* * * * ** ** ** * * 
AATACAACCA CAAATGGAAT TGAGGAGAAA TGGTAGGCAA TGTGGCATAT CTGAAAAAGA 
TTATGTTGGT GTTTACCTTA ACTCCTCTTT ACCATCCGTT ACACCGTATA GACTTTTTCT 

I Q P QME LRRN GRQ CGI SEKE> 

9250 9260 9270 9280 9290 9300 

* * ** * * * * ** ★* 
GGAGGAATGA TGAAGTATCT CAGACTTATT TTATAAGGGA GATGCTGTGC TGAGTTCTTC 
CCTCCHACT ACTTCATAGA GTCTGAATAA AATATTCCCT CTACGACACG ACTCAAGMG 

E E> 

a- Em 

9310 9320 9330 9340 9350 9360 

** ** * * ** ** ** 
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Figure 1 , continued 

CCTTTGAGGA AGGTATGTCA TATGAATCCA TTTCAAATCA AATTAAACTA ATAAAGTATG 
GGAAACTCCT TCCATACAGT ATACTTAGGT AAAGTTTAGT TTAATTTGAT TATTTCATAC 

9370 9380 9390 9400 9410 9420 

* * * * ** ** ** * * 

TATTATAAGG TAAAAAGAAA AAAAGACAAA GAAGAAGAAG AAGGAAGAAA GCCTTCAAGA 
ATAATATTCC ATTTTTCTTT TTTTCTGTTT CTTCTTCTTC TTCCTTCTTT CGGAAGTTCT 

9430 9440 9450 9460 9470 9480 

* ★ * * ** * * * * * * 

ATATGATGAC AGCTTTAGAA GATCGCTTTA GAMGCTATT TGGCACAAAT TCTACAACGG 
TATACTACTG TCGAAATCTT CTAGCGAAAT CTTTCGATAA ACCGTGTTTA AGATGTTGCC 

9490 9500 9510 9520 9530 9540 

* * * * ** ** * ★ * * 

GAGACAGTAC AGTGGAATCT GACGATGAAC CTCCTAAAAA AG AAAAAA GG GTGGACTGGG 
CTCTGTCATG TCACCTTAGA CTGCTACTTG GAGGATTTTT TCTTTTTTCC CACCTGACCC 

9550 9560 9570 9580 9590 9600 

* * * * * * ** * * * * 

ATGAGTATTG GGACCCTGAA GAAATAGAAA GAATGCTTAT GGACTAGTGA CTGTTTACGA 
TACTCATAAC CCTGGGACTT CTTTATCTTT CTTACGAATA CCTGATCACT GACAAATGCT 

9610 9620 9630 9640 9650 9660 

* * * * ** ** * * * * 

ACAAATGATA AATGATGGAA ACAGCTGAGC ATGACTCATA GTTAAAGCGC TAGCAGCTGC 
TGTTTACTAT TTACTACCTT TGTCGACTCG TACTGAGTAT CAATTTCGCG ATCGTCGACG 

9670 9680 9690 9700 9710 9720 

* * * * * * * * ** * * 

TTAACCGCAA AACCACATCC TATGTAAAGC TTGCTGATGA CGTATAATTT GCTCCACTGT 
AATTGGCGTT TTGGTGTAGG ATACATTTCG AACGACTACT GCATATTAAA CGAGGTGACA 

9730 9740 9750 9760 9770 9780 

** ** ** ** ** ** 

AAAAGTATAT AACCAGTGCT TTGTGAGACT TCGGGGAGTC TCTCCGTTGA GGACTTTCGA 
TTTTCATATA TTGGTCACGA AACACTCTGA AGCCCCTCAG AGAGGCAACT CCTGAAAGCT 

9790 9800 9810 9820 9830 9840 

** ★* ** ** ** ** 

GTTCTCCCTT GAGGCTCCCA CAGATACAAT AAATATTTGA GATTGAACCC TGTCAAGTAT 
CAAGAGGGAA CTCCGAGGGT GTCTATGTTA TTTATAAACT CTAACTTGGG ACAGnCATA 

9850 9860 9870 9880 9890 

* * ** ** ** ** 

CTGTGTAATC M il l IA CCT GTGAGGTCTC GGAATCCGGG CCGAGAACTT CGCA 
GACACATTAG AAAAAATGGA CACTCCAGAG CCTTAGGCCC GGCTCTTGAA GCGT 
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